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(54) INFORMATION PROCESSING SYSTEM, ENCIPHERING/DECIPHERING SYSTEM, SYSTEM 
LSI, AND ELECTRONIC APPARATUS 



(57) An information processing system that is con- 
figured in such a manner that computational processing 
is performed on input data in accordance with a 
processing sequence, for outputting data, comprises: a 
plurality of arithmetic units (7-1 to 7-x), each computing 
at an arithmetic precision 2 m bits (where m is a natural 
number) based on the processing sequence; and a plu- 
rality of cascade connection terminals for cascading 
these arithmetic units each other. When the maximum 
arithmetic precision that is required during computa- 
tional processing is 2 n bits (where n is a natural number 
and is fixed), x numbers of (where x is a natural number) 
the arithmetic units are cascaded in a manner such that 
the inequality x £ 2 n /2 m is satisfied. When an arithme- 
tic precision of 2 n1 bits (where n1 < n, and n1 is variable) 
is necessary during computational processing, x1 num- 
bers of the arithmetic units are cascaded in a manner 
such that the inequality x1 £ 2 n1 /2 m (where x1 is a 
natural number and is variable) is satisfied. This makes 
it possible to easily implement an information process- 
ing system for performing computations to any desired 
precision in a hardware manner, and also makes it pos- 
sible to support a simple hardware-based method of 
expanding the arithmetic precision. 
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Description 
Technical Held : 

5 [0001 ] The present invention relates to an information processing system and an encryption/decryption system and, 
in particular, to an information processing system that is capable of performing computation to any desired precision 
due to the hardware configuration thereof and an encryption/decryption system that is capable of performing encryp- 
tions and decryptions due to the hardware configuration thereof. In addition, the present invention also relates to an 
information processing system, a system LSI, and electronic equipment that use an internal or external bus line for an 

10 optical transmission method. 

Background pf Art: 

[0002] When complicated and large-scale computations are to be performed, such as those for random number gen- 
ts eration, wavelet transformation, neural networks, fast Fourier transformation, and digital filters, considerations of devel- 
opment costs and time scales make it standard in the art to implement them by software, using general-purpose 
computation devices, instead of dedicated hardware which is only used in specific applications. 
[0003] The recent spread of the Internet has increased the necessity of information security techniques, from the 
viewpoints of electronic transactions and privacy protection, and cryptographic techniques have drawn attention as 
20 effective means of providing them. 

[0004] The main categories of cryptographic methods are private-key cryptosystem and public-key cryptosystem. 
where a typical private-key cryptosystem is the Data Encryption Standard (DES) and a typical public-key cryptosystem 
is Rivest-Shamir-Adleman criptograph (RSA) system. 

[0005] In principle, DES is a method of rearranging or substituting of a data bit string and RSA is a method of per- 
25 forming residue operations on an extremely large number of bits, so a public-key cryptosystem tends to be several hun- 
dred times slower than a private-key cryptosystem. This is because public-key cryptosystems perform residue 
computations to an extremely high precision, using at least several hundred bits as a modulus. 
[0006] In such a case, it has become common to use a cryptographic method in which a private-key cryptosystem is 
used to encrypt a large-volume data string and a public-key cryptosystem is used for delivering authentication, a signa- 
30 ture, and a key, which involve a small volume of data. 

[0007] With public-key cryptosystem, the encryption strength can be selected by varying the bit length of the key, so 
there is a demand for methods of enabling computations using public keys of various different numbers of bits that have 
been revealed to communications partners. 

[0008] If it is desired to exceed the arithmetic precision of a general-purpose computation device, when using a tech- 
35 nique of employing such a general-purpose computation device to implement computations by software, such as those 
used in random number generation, wavelet transformation, neural networks, fast Fourier transformation, and digital fil- 
ters, it is necessary to support everything in the software and thus it may become impossible to implement this in reality 
because of increases in programming labor and processing time. 

[0009] It is regulated that DES must be implemented by using hardware techniques. Therefore, it is in fact difficult to 
40 prevent third parties from breaking the ciphers of an encryption/decryption system created by software alone. If part of 
the encryption algorithm could be implemented by hardware, it will be possible to increase the strength of the encryp- 
tion. 

[0010] A single chip public-key cryptographic processor has already been proposed, similar to this public-key crypto- 
system (in the proceedings of the Electronic Information Communications Society, D-l, Vol. J80-D-I, No. 8, pp. 725-735, 
45 1997-8). 

[0011] However, although this single chip public-key cryptographic processor does enable processing at any desired 
precision within a precision range that is set beforehand by the hardware, processing at a precision in excess of that 
preset precision is not possible, raising the problem that new hardware must be designed therefor. 
[0012] If the computation device is configured of thin film transistors (TFTs) on a single chip, by way of example, the 
so presence of non-operating transistors could make the entire computation device unusable, raising problems of a reduc- 
tion in yield and also an increase in fabrication costs. 

[0013] As yet another technical concern, recent advances in microcomputers has promoted the miniaturization and 
value-added capabilities of appliances in various fields, but it has become extremely difficult to design such microcom- 
puters. Together with the push for increases integration of microcomputers formed of microprocessors (MPUs) on sin- 
55 gle chips, wiring widths are now of the submicron order and the resistance thereof is increasing, and also the stray 
capacities thereof have been increased by multi-layer configurations, so that distributed parameter circuits are formed 
in the wiring sections, which increased the propagation delays in electrical signals. 

[0014] This means that it is necessary to determine the instruction cycle of the central processing unit (CPU) within 
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the microcomputer, after considering the time required for the CPU to transmit data and addresses to other functional 
units and the lime required before the CPU can input data from the other functional units, which increases the design 
load caused by signal propagation delays. 

[0015} ' Recent trends in-miniaturization and additional functions have meant that progress has been only in the direc- 
5 tions pf VLSIs and higher levels of integrations, making the design of microcomputers even more difficult. 

[001 6] Increases in wiring resistance and capacity have not only produced signal propagation delays, they also cause 
deformation |of the rectangular waveforms of signals. There have been recent advances in lower power consumptions, 
particularly in portable electronic equipment, but if voltage levels become lower and the extent of deformation of the rec- 
tangular waveforms increases, it is no longer possible to ensure the functions of signals and erroneous operation can 
io occur. 

[001 7] An objective of the present invention is to provide an information processing system that enables a system that 
can compute to any desired precision, using a simple hardware structure. 

[001 8] Another objective of the present invention is to provide an encryption/decryption system that makes it possible 
to facilitate the construction of a system that is required to operate at any desired precision (any desired number of bits), 
is using hardware. 

[0019] A further objective of the present invention is to provide an information processing system and an encryp- 
tion/decryption system that make it possible to improve the effective yield when the system is configured on a single 
chip, and also improve the reliability of the system. 

[0020] Yet another objective of the present invention is to provide an information processing system and a system LSI, 
20 together with electronic equipment using the same, in which signal propagation delays are reduced to a degree such 
that changes in signal waveform can be ignored, and which can be designed simply. 

[0021] A still further objective of the present invention is to provide a system LSI, together with electronic equipment 
using the same, that make it possible to simplify the management of intellectual property rights when using the above 
encryption/decryption system in a simple manner, even when it incorporates functional blocks developed by other conv 
25 panies. 

Disclosure of Invention 

[0022] One aspect of the present invention relates to an information processing system in which computational 
30 processing is performed on input data in accordance with a processing sequence, for outputting data, the information 
processing system comprising: 

a plurality of arithmetic units, each computing to an arithmetic precision of 2 m bits (where m is a natural number), 
based on the processing sequence; and 
35 a plurality of cascade connection terminals for cascading the arithmetic units each other, 

wherein, when the maximum arithmetic precision that is required during computational processing is 2 n bits (where 
n is a natural number and is fixed), x numbers of (where x is a natural number) the arithmetic units are cascaded 
in a manner such that the inequality x £ 2 n /2 m is satisfied. 

40 [0023] This aspect of the invention simplifies the hardware processing by distributing the computational processing 
among a plurality of computational processing units, making it possible to configure a system in a simple manner and - 
increase the speed of computational processing, even when high-volume computations and large precisions are 
required for processing such as that for random number generation, wavelet transformation, fast Fourier transformation, 
and digital filters. As a result of this distributed processing, it is possible to restrain the number of gates of the arithmetic 

45 units, reducing device costs. 

[0024] In this aspect of the invention, when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is 
required during computational processing, x1 numbers of the arithmetic units are cascaded in a manner such that the 
inequality xl £ 2 n1 /2 m (where x1 is a natural number and is variable) is satisfied. In this case, a clock generation circuit 
generates 2 n1 numbers of reference clock pulses during computational processing. This ensures that x1 out of the x 

so numbers of arithmetic units can be cascaded efficiently. With this configuration, the power that would be consumed by 
the remaining arithmetic units is saved and computational speeds are also increased. If the configuration is such that 
optical signals are transmitted between the plurality of arithmetic units, the computational speeds can be further 
increased. 

[0025] Each of the plurality of arithmetic units may comprise y numbers of arithmetic modules (where y is a natural 
55 number and is fixed), each computing at an arithmetic precision of 2"^ when y numbers of the arithmetic modules are 
cascaded. 

[0026] When an arithmetic precision of 2 n1 bits (where n1 ^ n, and nl is variable) is required during computational 
processing, the y numbers of arithmetic modules may be cascaded in a manner such that the inequality y1 £ 2 n1 /2 
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(where y1 is a natural number and is variable) is satisfied. 

[0027] Alternatively, computations may be executed by (x1 - 1) numbers of cascaded arithmetic units at an arithmetic 
precision of 2 n2 (where n2 < n1) for performing computations at a 2 n1 -bit precision; and y1 numbers of arithmetic mod- 
ules in one other arithmetic unit, which is cascaded from the (x1 - 1) numbers of arithmetic units, may be cascaded in 
5 a manner such that the inequality y1 £ (2 n1 -2 n2 )/2 (where y1 < y, and y1 is variable) is satisfied. 

[0028] If the configuration is such that optical signals are transmitted between the maximum y arithmetic modules, the 
computational speeds within the arithmetic units can be further increased. 

[0029] This aspect of the invention could be further provided with a first storage section for storing a processing 
sequence. In addition, it could be further provided with computation control means for controlling the plurality of arith- 
10 metic units based on the processing sequence stored in the first storage section. In such a case, the optical signals may 
be transmitted between the plurality of arithmetic units and the computation control means. 

[0030] This aspect of the invention could further comprise a second storage section for temporarily storing computa- 
tion results of the plurality of arithmetic units, where optical signals are transmitted between the second storage section 
and the plurality of arithmetic units. 

75 [0031 ] To give an example of the computations performed by the plurality of arithmetic units, when the input data is X 
and N and output data is Y, Y = X 2 mod N is computed to generate a pseudo-random number. 
[0032] In this case, when the yield of the plurality of arithmetic units is A and the total number of arithmetic units that 
are available is K, a maximum number x (where x £ K/A) of the arithmetic units that are operative may be cascaded. 
[0033] Alternatively, when the yield of the arithmetic modules is A' and the total number of arithmetic modules pro- 

20 vided in each of the plurality of arithmetic units is L, a maximum number y (where y £ L/A*) of the arithmetic modules 
that are operative may be cascaded. 

[0034] Another aspect of the present invention relates to an information processing system in which computational 
processing is performed on input data in accordance with a processing sequence, for outputting data, the information 
processing system comprising: 

25 

a plurality of internal arithmetic units, each computing to an arithmetic precision of 2 m1 bits (where ml is a natural 
number and is fixed), based on the processing sequence; 

a plurality of external arithmetic units for computing to an arithmetic precision of 2 m2 bits (where m2 is a natural 
number and is fixed); and 

30 cascade connection terminals for cascading the plurality of internal arithmetic units and the plurality of external 
arithmetic units, 

wherein, when the maximum arithmetic precision required during computational processing is 2" bits (where n is a 
natural number and is variable), z numbers of the external arithmetic units are cascaded in a manner such that the 
inequality z ^ (2 n - 2 m1 )/2 m2 (where z is a natural number and is fixed) is satisfied. 

35 

[0035] This configuration makes it possible to expand the arithmetic precision in a simple hardware manner, ensuring 

the reliability of the system while simplifying the design of the information processing system. 

[0036] An encryption/decryption system in accordance with a further aspect of the present invention comprises: 

40 a plurality of power residue arithmetic units; and 

a plurality of cascade connection terminals for cascading the plurality of power residue arithmetic units, 
wherein each of the plurality of power residue arithmetic units comprises: 

a multiplication unit for performing a multiplication at a multiplication precision of 2 m bits (where m is a natural 
number and is fixed); and 
45 a division unit for performing a division at a division precision of 2 2xm bits, 

wherein, when the maximum arithmetic precision of a power residue computation executed by the plurality of power 
residue arithmetic units is 2 n (where n is a natural number and is fixed), x numbers of the plurality of power residue 
arithmetic units are connected together for encryption and decryption, in a manner such that the following inequal- 
ity is satisfied: 

50 

x £ 2 n /2 m (where x is a natural number and is fixed). 

[0037] This configuration simplifies the processing of the hardware by distributing the computational processing 
required for encryption or decryption among a plurality of computational processing units, making it possible to config- 
55 ure a system in a simple manner and increase the speed of computational processing, even when high-volume compu- 
tations and large precisions are required for the processing. As a result of this distributed processing, it is possible to 
restrain the number of gates of the arithmetic units, reducing device costs. 

[0038] An encryption/decryption system in accordance with a still further aspect of the present invention comprises: 
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a plurality of internal power residue arithmetic units for performing power residue computation at a multiplication 
precision of 2 m1 bits (where ml is a natural number and is fixed) and a division precision of 2 2xm1 bits; 
a plurality of external power residue arithmetic units for performing power residue computation at a multiplication 
precision of 2 m2 bits (where m2 is a natural number and is fixed) and a division precision of 2 2xm2 bits; and 
5 cascade connection terminals for cascading the pluralities of internal and external power residue arithmetic units, 

wherein, when encryption and decryption is performed at a maximum bit precision of 2 n (where n is a natural 
number and is fixed), z numbers of the external power residue arithmetic units are cascaded in a manner such that 
the following inequality is satisfied: 

10 z £ (2 n - 2 m1 )/2 (where z is a natural number and is fixed). 

[0039] This configuration makes it possible to extend the arithmetic precision for power residue computations in a sim- 
ple hardware manner, making it possible to ensure sufficient reliability for the precision of the key. 
[0040] In addition, since external power residue arithmetic units can be configured with a small number of gates, it is 
15 possible to simply construct an encryption/decryption system that can run on even a personal computer system. 

[0041] Yet another aspect of the present invention relates to an information processing system comprising a plurality 
of functional units, each operating based on multi-channel electrical signals, and a bus line for transmitting signals ther- 
ebetween, wherein: 

20 each of the plurality of functional units comprises a signal output section and/or a signal input section; 

the signal output section comprises electrical-optical signal conversion circuit for converting multi-channel electrical 

signals into multi-channel optical signals of different wavelengths, and outputting the same; 

the signal input section comprises optical -electrical signal conversion circuit for converting the multi-channel optical 

signals of different wavelengths into the multi-channel electrical signals; and 
25 the bus line is formed by an optical transmission medium. 

[0042] In this case, signal transmission between one functional unit and another functional unit is through a bus line 
that is an optical transmission medium, using multi-channel optical signals having separated wavelength. For that pur- 
pose, a signal output section of each functional unit comprises electrical-optical signal conversion circuit for converting 
30 multi-channel electrical signals into multi-channel optical signals, and a signal input section of each functional unit com- 
prises optical-electrical signal conversion circuit for converting multi-channel optical signals into multi-channel electrical 
signals. 

[0043] Since transmission between the functional units on the single-chip are optical, signal propagation delays can 
be virtually ignored because signal propagation is at the speed of light. As a result, there are no design restrictions 
35 caused by such signal propagation delays, making the design of the information processing system simpler. 

[0044] In this aspect of the invention, one of the plurality of functional units may be a central processing unit (CPU), 
and the bus line may comprise a data bus line and an address bus line. The data and address bus lines are both formed 
as optical transmission lines. 

[0045] In this case, one optical transmission medium may be used in common for the data bus line and the address 
40 bus line. Data and addresses could be transmitted not simultaneously, but in a time division manner. 

[0046] In this aspect of the invention, an optical input-output section for providing optical communications with periph- 
eral equipment is preferably connected to the bus line. With such a configuration, an optical signal that has been con- 
verted by the electrical -optical signal conversion circuit of one of the functional units within the information processing 
system can be sent without any further modification to the peripheral equipment and. conversely, an optical signal from 
45 the peripheral equipment can be input without any further modification to the information processing system. This also 
has the advantage that no noise is superimposed on optical signals and there is no need to remove noise from optical 
signals. 

[0047] The optical input-output section for providing optical communications with peripheral equipment preferably 
enables optical communications to and from the peripheral equipment, by a chip select signal transmitted from the cen- 
50 tral processing unit through the bus line. In such a case, an optical shutter or the like, which is operated by the chip 
select signal, could form the optical input-output section. 

[0048] A yet further aspect of this aspect of the invention relates to a system LSI configured of a first semiconductor 
device having a first internal bus line and a second semiconductor device having a second internal bus line, connected 
together by an external bus line, wherein: 

55 

the first semiconductor device comprises a first signal output section and a first signal input section, and a central 
processing unit (CPU) operating based on multi-channel electrical signals is formed on a first substrate, with the 
first internal bus line; 
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the second semiconductor device comprises a second signal output section and a second signal input section, and 
a controlled unit that is controlled by signals from the central processing unit and operates based on multi-channel 
electrical signals is formed on a second substrate, with the second internal bus line; 

each of the first and second signal output sections comprises electrical-optical signal conversion circuit for convert- 
5 ing multi-channel electrical signals into multi-channel optical signals having different wavelengths, and outputting 

the same; 

each of the first and second signal input sections comprises optical-electrical signal conversion circuit for convert- 
ing multi-channel optical signals having different wavelengths into multi-channel electrical signals; and 
the first and second internal bus lines and the external bus line are each formed by an optical transmission medium. 

10 

[0049] In case, for example, the first semiconductor device is the above described single-chip microcomputer, this 
configuration makes it possible for at least one functional unit that ought to be incorporated into the microcomputer to 
be incorporated into a second semiconductor device instead. In this case, the functional units within the first and second 
semiconductor devices can transmit signals to each other by light through the first and second internal bus lines and 

is the external bus line. Therefore, functional units that are on spatially separated substrates are connected by bus lines 
capable of communicating at the speed of light, so that this configuration is functionally equivalent to a microcomputer 
incorporating a plurality of functional units sending signals to each other by light within a single chip as described above. 
[0050] Thus some of the functional units that ought to be incorporated into a single chip can be placed externally 
thereto, which has the advantage of making it unnecessary to increase the integration. Alternatively, if it is desired to 

20 add arithmetic functions, memory or the like to a microcomputer, it is not necessary to re-design the single-chip micro- 
computer as in the conventional art, so that a basic microcomputer can be attached to a semiconductor device via an 
external bus line that is an optical transmission medium. This has the advantage of further increasing the universality 
of the microcomputer. 

[0051 ] In particular, application of the present invention makes it possible to freely modify the capacity of cache mem- 
25 ory. 

[0052] More preferably, the configuration is such that an internal clock on an internal bus line can be supplied to the 
external bus line. This makes it possible for an internal clock to act as a reference clock for all of the functional units, so 
that data can be processed in synchronized mode. 

[0053] Incorporating the above described information processing system or system LSI of the present invention is 
30 incorporated into electronic equipment makes it possible to support modifications of the functions of the electronic 
equipment. 

[0054] A final aspect of in accordance with this aspect of the invention relates to a system LSI incorporating a plurality 
of functional blocks developed by different manufacturers, wherein: 

35 at least one of the plurality of functional blocks comprises a four-arithmetical-operations function area; and 

the four-arithmetical-operations function area executes computations for decryption when a predetermined decryp- 
tion key has been input thereto, and a four-arithmetical-operations function other than the decryption is enabled 
after the decryption is established. 

40 [0055] Assume that this system LSI comprises a functional block that was developed by Company A, by way of exam- 
ple. A decryption key (private key) in accordance with the RSA method is disclosed by license to the end user who is 
using the functional block developed by Company A within this system LSI. Among the users of this system LSI, only 
end users who possess the decryption key can use this functional block. To use the functional block developed by Com- 
pany A in this configuration, it is necessary to acquire the decryption key from Company A by license, simplifying the 

45 management of intellectual property rights. 

[0056] The four-arithmetical -operations function area could also comprise x numbers of arithmetic units, each having 
an arithmetic precision of 2 m bits (where m is a natural number and is fixed), that are cascaded for computations. A por- 
tion of the plurality of arithmetic units could comprise a plurality of power residue arithmetic units that are cascaded. 

so Brief Description of Drawings 

[0057] 

Figs. 1 A and IB are schematic block diagrams of the configuration of an information processing system in accord- 
55 ance with a first embodiment of the present invention; 

Fig. 2 is a schematic block diagram of the programmable digital processor within Figs. 1 A and 1 B; 
Fig. 3 is a block diagram of a specific example of the pseudo-random number generation circuit of the first embod- 
iment; 
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Fig. 4 is a.schematic block diagram of an encryption/decryption system in accordance with a second embodiment 
of the present invention; 

Fig. 5 is a schematic block diagram of the configuration of a power residue arithmetic device; 
F^igJ 6 is a schematic block diagram of the configuration of an arbitrary -precision division device; 
5 Fjgs. 7A to 7C are diagrams illustrating the difficulties involved with division distributed processing; 

Fig. 8 is an illustrative view of the structure and operation of an arbitrary -precision divider; 
Fig. 9 is- a schematic block diagram of the configuration of a divider; 
Fig. 10 is a timing chart of the operation of the divider; 

Fig. 1 1 shows the processing sequence of encryption and decryption in the public-key cryptosystem based on the 
io RSA method in accordance with the second embodiment of the present invention; 

Fig. 12 illustrates the number of gates necessary for a power residue arithmetic device; 
Fig. 13A is an illustrative view of a technique for computing a b - M mod c; 
Fig. 13B is a block diagram of a circuit that enables the operation of Fig. 13A. 

Fig. 13C is an illustrative view of an arithmetic example for decrypting 186 19 = M mod 377 to obtain M = 17, in 
is accordance with the technique shown in Fig. 13A; 

Fig. 14 is an illustrative view of the calculation of the computation time necessary for the technique of Fig. 13A; 
Fig. 15 illustrates examples of computation times; 

Fig. 16 shows the processing sequence for encryption and decryption in accordance with a prior-art public-key 
cryptosystem based on the RSA method; 
20 Fig. 17 is a block diagram of a microcomputer in accordance with a fourth embodiment of the present invention; 

Fig. 18 is a schematic illustrative view of a region comprising the optical transmission sections of the microcom- 
puter of Fig. 17; 

Fig. 1 9 is a schematic illustrative view of light-emitting elements for three channels making up the light-emitting sec- 
tion of Fig. 18; 

25 Fig. 20 is a schematic illustrative view of an example of the fabrication of the light-emitting layer of Fig. 19; 

Fig. 21 is a schematic cross-sectional view of light-receiving elements for three channels making up the light-emit- 
ting section of Fig. 18; 

Fig. 22 is a schematic illustrative view of a modified example of the light-emitting section of Fig. 18; 
Fig. 23 is a schematic illustrative view of a system LSI in accordance with a fifth embodiment of the present inven- 
30 tion; 

Fig. 24 is a schematic illustrative view of a sixth embodiment of the present invention, in which light-emitting sec- 
tions, waveguides, and light-receiving sections are disposed three-dimensionally; and 

Fig. 25 is a schematic view of a system LSI in accordance with a seventh embodiment of the present invention. 

35 Best Mode for Carrying Out the Invention 

[0058] Preferred embodiments of the present invention are described below with reference to the accompanying fig- 
ures. 

40 First Embodiment 

[0059] A block diagram of an information processing system in accordance with a first embodiment of the present 
invention is shown in Fig. 1 A. 

[0060] This information processing system 10 is configured of a number of general-purpose microprocessors, such 
45 as two as shown by way of example in Fig. 1 A. In Fig. 1 A, this information processing system 10 is provided with a main 
microprocessor unit 1 , for controlling the entire system, and a programmable digital processor 3 that is connected to the 
main microprocessor unit 1 through a bus 2. The programmable digital processor 3 functions as a coprocessor for per- 
forming operations at high speed and to a desired arithmetic precision in accordance with a program (processing 
sequence) that is previously set into the main microprocessor unit 1 . 
so [0061] Depending on the specifications required of the information processing system 10. various external devices, 
that are not shown in the figure, such as a keyboard, mouse, or other input device; a printer or other output device; 
ROM, RAM, or other storage device; and a hard disk are connected to the bus 2. either directly or via interface devices. 
[0062] The configuration of the programmable digital processor 3 of Fig. 1 is shown in Fig. 2. This programmable dig- 
ital processor 3 has system ROM and system RAM (not shown in the figure), which are programmed by processing 
55 sequences loaded in advance from the main microprocessor unit 1 , to perform processing such as random number 
generation, wavelet transformation, fast Fourier transformation, or digital filtering. 

[0063] The programmable digital processor 3 is provided with a control unit 5 that controls the entire system, RAM 6 
and ROM 8 for storing various items of data and the processing sequence during the computations, x numbers of arith- 
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metic units 7-1 to 7-x (where x is a natural number and is fixed) that each operate at an 2 m -bit arithmetic precision 
(where m is a natural number and is fixed), switches SW for cascading these arithmetic units 7-1 to 7-x, an internal sys- 
tem bus 4A for transmitting address data and instruction data, and an internal data bus 4B for transmitting various types 
of data. Note that the internal system bus 4A and the internal data bus 4B are configured of an internal bus 4. Note also 

5 that the switches SW are shown in this figure for convenience in comprehending the configuration of the present 
embodiment in a graphical manner. In practice, the number of arithmetic units that are cascaded efficiently is deter- 
mined based on a factor such as the number of reference clock pulses generated from the control unit 5 within the pro- 
grammable digital processor 3, by way of example; the configuration is not such that switches are turned on and off 
mechanically. This point will be discussed later. 

10 [0064] In this case, if the maximum bit precision obtained when all of the arithmetic units are connected together as 
x cascades is assumed to be 2 n (where n is a natural number and is fixed), the maximum number x of cascade connec- 
tions must satisfy this inequality: 

x;>2 n /2 m 

15 

[0065] Note that, if the required arithmetic precision is a bit precision of 2 n1 (where n1 is a variable such that n1 < n) 
which is less than or equal to the maximum bit precision 2 n , it is possible to obtain results even when all of the x num- 
bers of arithmetic units 7-1 to 7-x are in an operation-possible state. However, x1 numbers of arithmetic units 7-1 to 7- 
x1 of the maximum number x of the arithmetic units 7-1 to 7-x could also be cascaded in a way that ensures that they 
20 operate efficiently, so long as the following inequality is satisfied: 

x 2> x1 ;> 2 n1 /2 m (where x1 is variable) 

It is therefore possible to plan for the most appropriate power consumption, without any unnecessary consumption of 
25 power, if operations are to be performed to a 2 n1 -bit precision, and thus further optimize the computation time. 

[0066] In this case, each of the arithmetic units 7-1 to 7-x has y numbers of arithmetic modules 9-1 to 9-y which are 
capable of performing at an arithmetic precision of 2 m/ y (where y is an integer greater than or equal to 2) and which can 
be cascaded through switches SW. Note that these switches SW are shown merely for convenience, to make the con- 
figuration of the present embodiment readily comprehensible in a visual manner. In practice, the number of arithmetic 
30 modules that are cascaded efficiently is determined based on a factor such as the number of reference clock pulses 
generated from the control unit 5 within the programmable digital processor 3, by way of example; the configuration is 
not such that switches are turned on and off mechanically 

[0067] If the arithmetic unit 7-1 and the arithmetic unit 7-2 are cascaded efficiently, for example, this is equivalent to 
cascading a total of 2y numbers of arithmetic modules in this case. 
35 [0068] It is also possible with the present embodiment to efficiently cascade y1 £ 2 n1 /2 (where y1 < y) numbers 
of arithmetic modules of all the y numbers of arithmetic modules 9-1 to 9-y within one arithmetic unit, which is sufficient 
for computing to a 2 n1 -bit precision. This makes it possible to further optimize the power consumption and computation 
time. 

[0069] Alternatively, it is possible to use (x1 - 1) numbers of arithmetic units 7-1 to 7-(x - 1) and y1 numbers of arith- 
40 metic modules within the arithmetic unit 7-x1 , and still compute to a 2 n1 -bit precision. If the (x1 - 1) arithmetic units 7-1 
to 7-(x - 1) are to compute to a 2 n2 -bit precision (where n2 < n1) with such a configuration, it is sufficient that the follow- 
ing inequality is satisfied: y1 ;> (2 n1 - 2 ^12 (where y1 <> y). 

[0070] The description now turns to the operation of the device shown in Figs. 1 A and 2. 

[0071] First of all, the main microprocessor unit 1 sends arithmetic instructions, the data necessary for the computa- 
45 tions, and arithmetic precision data corresponding to the required arithmetic precision to the programmable digital proc- 
essor 3 over the bus 2. 

[0072] This ensures that the control unit 5 of the programmable digital processor 3 controls the number of arithmetic 
units or arithmetic modules corresponding to the required arithmetic precision 2 n1 , from among the arithmetic units 7-1 
to 7-x, based on a processing sequence from the main microprocessor unit 1 and/or the RAM 6 (such as the encryption 
so sequence shown in Fig. 13B). The arithmetic units 7-1 to 7-x decide the numbers of units and modules that ought to be 
cascaded, based on the precision information of the arithmetic data. The numbers of units and modules that ought to 
be cascaded is determined by the number of reference clock pulses necessary for the computations. This reference 
clock signal is generated by a reference clock signal generation circuit (not shown in the figures) within the information 
processing system 10. 

55 [0073] This ensures that the numbers of arithmetic units and arithmetic modules necessary for the computations are 
cascaded efficiently, and the control unit 5 can use those arithmetic units for the computations, based on a processing 
sequence that is programmed in previously. 

[0074] The thus obtained computation results at a precision of 2 n1 are output to the main microprocessor unit 1 
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through the bus 2. 

[0075] In this case, multi-channel electrical signals could be used for the signals over the bus 2 between the chips of 
the first embodiment, in other words, between the main microprocessor unit 1 and the programmable digital processor 
3. or multi-channel optical signals having different wavelengths could be used therefor. Many techniques for providing 

5 optical communication between chips have already been proposed. 

[0076] Furthermore, optical signals can also be used in this first embodiment for transmissions between all or some 
of the components of the programmable digital processor 3 configured on a single chip. In other words, the internal sys- 
tem bus 4A and internal data bus 4B of the programmable digital processor 3 shown in Fig. 2 could be configured of an 
optical transmission path. This would make it possible for a control unit 105, the RAM 6. the ROM 8, and the x numbers 

10 of arithmetic units 7-1 to 7-x to transmit optical signals through the internal system bus 4A and the internal data bus 4B. 
In this case, if the configuration is such that information other than electrical signals, for example, magnetized informa- 
tion such as a ferroelectric memory, is stored as a storage element configured of the RAM 6, it is not necessary to con- 
vert optical signals into electrical signals. If ferroelectric memory is used, by way of example, information that has been 
magnetized based on optical signals could be stored. Furthermore, since the x numbers of arithmetic units 7-1 to 7-x 

is are substantially connected by a bus and the y numbers of arithmetic modules making up each of these arithmetic units 
are also connected by a bus, those buses can be used as optical transmission paths so that optical signals can be 
transmitted thereby between the arithmetic units and the arithmetic modules. 

[0077] Note that a specific example of the use of optical transmission paths for internal buses within a single chip will 
be described later. 

20 [0078] The description above concerned with a configuration in which there is a maximum of x numbers of arithmetic 
units, but it can also be provided with terminals for cascading external arithmetic units onto the programmable digital 
processor 3, to expand the functions of the system later. Thus, if the configuration is such that external arithmetic units 
can be cascaded therefrom, it will be possible to ensure an arithmetic precision that is greater than the arithmetic pre- 
cision obtained by the cascaded internal arithmetic units alone. 

25 [0079] More specifically, even if the programmable digital processor 3 is configured as a single chip, one or a plurality 
of programmable digital processors 3' could be cascaded as shown by the broken lines in Fig. 2 through the internal 
data bus and cascade connection terminals (not shown in the figure). This is equivalent to cascading the arithmetic 
units within the programmable digital processor 3' to the arithmetic units within the programmable digital processor 3'. 
This means that the arithmetic units within one or more programmable digital processors 3* function as external arith- 

30 metic units with respect to the internal arithmetic units 7-1 to 7-x within the programmable digital processor 3. It is also 
possible to transmit signals by optical signals between these internal arithmetic units and external arithmetic units. 
[0080] If the plurality of external arithmetic units is configured of, for example, thin film transistors (TFTs), there is a 
danger that there may be inoperative transistors due to factors such as damage, so that a maximum of z external arith- 
metic units can no longer be used. 

35 [0081 ] If the yield of the arithmetic units is A in such a case, it is preferable to ensure that at least K numbers of (where 
z £ K/A ) arithmetic units are provided in advance. This makes it possible to connect a maximum of z operative arith- 
metic units together out of the K numbers of arithmetic units. 

[0082] Since all of the arithmetic modules are the same, not just in the arithmetic units but also in the external arith- 
metic units, if the yield of external arithmetic modules is A', it is best to ensure that there are at least L (where y £ UPC 
40 ) arithmetic modules in each of the external arithmetic units. This makes it possible to connect y numbers of the arith- 
metic module that are operative among the L numbers of arithmetic modules. Therefore, if an arithmetic module is - 
found to be defective, that arithmetic unit can be handled as an operative unit by switching in one of the redundant arith- 
metic modules therein. 

[0083] In addition, to implement computations at a maximum arithmetic precision of 2 n , when internal arithmetic units 
45 within the programmable digital processor 3 that operate at an arithmetic precision of 2 m1 bits (where ml is a natural 
number) are cascaded with external arithmetic units within the programmable digital processor 3' that operate at an 
arithmetic precision of 2 m2 bits (where m2 is a natural number), z numbers of the external arithmetic units could be cas- 
caded in a manner such that the following inequality is satisfied: 

so z^(2 n -2 m1 )/2 m2 . 

[0084] If computations are to be at a desired bit precision of 2 n1 (where n1 is a natural number, and n1 < n), only the 
external arithmetic units that are necessary for the computations can be selected for operation by cascading z numbers 
of the external arithmetic units in a manner such that the following inequality is satisfied: 

55 

z £ z1 £ (2 n1 - 2 m1 )/2 m2 (where z1 is a natural number). 
As a result, it is possible to optimize the power consumption and also optimize the computation time. 
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[0085] Note that the main microprocessor unit 1 of the embodiment shown in Fig. 1A is an indispensable structural 
requirement for constructing the information processing system 10. However, if the configuration is such that the func- 
tions of the main ;microprocessor unit 1 are provided in the programmable digital processor 3 and a program is loaded 
from a storage device or an external storage device through the bus 2, the main microprocessor unit 1 could be made 

5 unnecessary, as demonstrated by an information processing system 10' shown in Fig. 1 B. 

[0086] The description above concerned a case in which the programmable digital processor 3 loads a program, but 
it should be obvious that the present invention is not limited thereto. For example, the program for the processing 
sequence could be stored beforehand in non-volatile memory such as a mask ROM, PROM, or EEPROM, making it 
possible for the operation to be based on a fixed processing sequence, so that the fixed processing sequence can be 

io implemented by hardware logic. 

[0087] The description now turns to an example in which the present invention is applied to a pseudo-random number 
generation device, as a specific example of the first embodiment. 

[0088] If N is a product of large prime numbers, it is possible to create a computationally secure pseudo-random 
number using a non-linear operator Y = X 2 mod N . In this case "computationally secure" means that breaking the 
15 security thereof (in other words, being able to use a partial random number string that is part of the generated random 
number string to guess another partial random number string that is another part of the generated random number 
string) is equal in computational complexity to factorizing N. 

[0089] That is to say, no simple method of calculating factors has been discovered during the long history of mathe- 
matics, so it is probably impossible to predict such a random number. 
20 [0090] A block diagram of the generation of a computationally secure pseudo-random number using Y = X 2 mod N 
is shown in Fig. 3. In this case: 

N = P xQ 

25 where P and Q are both large prime numbers. 

[0091] As shown in Fig. 3. a pseudo-random number can be generated to any desired precision during the computa- 
tional processing of Y = X 2 mod N from input data X and N, by using the programmable digital processor 3 of the first 
embodiment. 

[0092] As described above, the present embodiment of the invention makes it possible to easily implement an infor- 
30 mation processing system for performing calculations to any desired precision in a hardware manner, and also makes 
it possible to support a simple hardware method of expanding the arithmetic precision 

[0093] In the description of this first embodiment of the invention, the control unit 5 and the RAM 6 were provided 
within the programmable digital processor 3, but it is also possible to configure the information processing system in 
such a manner that these functions are performed by the main microprocessor unit 1 and the arithmetic units alone are 
35 fabricated on a single chip, to enable the system to function as an arbitrary-precision general-purpose processor. 

Second Embodiment 

[0094] The description now turns to a second embodiment in which the present invention is applied to an encryp- 
40 tion/decryption system which performed encryption and decryption processing. 

1. Principles of RSA Method 

[0095] Before concentrating on a specific embodiment, the description first concerns the RSA method, which is a typ- 
45 ical public-key cryptosystem. 

[0096] In one usage mode of network encryption, the sender uses an encryption key to encrypt and send message 
text, and the recipient of the encrypted text uses a decryption key to decrypt the received text and thus reconstitute the 
message text. 

[0097] In contrast to a private-key cryptosystem in which the encryption key and the decryption key are the same, 
so these keys are not the same in a public-key cryptosystem where the encryption key is published but the decryption key 
is kept secret. The principle of the RSA method, which is a typical public-key cryptosystem, will now be discussed. 
[0098] Assume that the message text is divided into blocks of a suitable size and a numerical value corresponding 
thereto is M. Assume also that prime numbers p and q are determined, the decryption key (private key) is kept secret, 
and the public-key (public key) is such that n and e satisfy the following relationships: 

55 

n = pq (1) 
gcd(e,(p-1,q-1)) = 1 (2) 
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[0099] In this case, the greatest common divisor (gcd) of e, (p - 1), and (q - 1) is 1 ; in other words, a positive number 
e which is prime to (p - 1) and (q - 1) respectively is appropriately determined. 

[0100] The sender uses the public key to perform residue computations on message text M to the power of e. using 
n as modulus (in other words, to derive the remainder C when M e is divided by n) as shown in Equation 3, and sends 
s encrypted text C. 

M e eCmodn (3) 

[01 01 ] The recipient who has received the encrypted text C first uses the private key to find the reciprocal d of e, using 
io (p - l)(q - 1) as modulus, as shown in Equation 4. 

ed - 1 mod(P- 1)(q- 1) (4) 

[0102] The recipient then deciphers the encrypted text C into the message text M, using the thus obtained d, from 
75 Equation 5. 

C d M mod n (5) 

[0103] Incidentally, if Equations 4 and 5 use the lowest common multiple (lem) (p - 1, q - 1) of (p - 1) and (q - 1), d can 
20 be set to be small, which leads to a reduction in the computational complexity as shown in Equations 6 and 7. 

gcd(e, lcm(p-1,q-1)) = 1 (6) 

ed = 1 mod lem (p - 1 , q - 1) (7) 

25 

[0104] In this case, since the public key n is the product of private keys p and q, it is usually selected to be at least 
512 bits long, so as to disable easy prime factorization thereof. 

[0105] The RSA method described above is widely acknowledged to be a method with a high level of encryption 
strength, because of factors such as the use of the mathematical basic known as prime factorization and multiple-pre- 
30 cision residue computations. 

2. Example of Residue Computations 

[0106] To ensure that the power residue computation results of Equations 3 and 5 form encrypted/decrypted text, it 
35 is necessary to perform integer computations without using any rounding or floating-point arithmetic. Power residue 
computations enable this system to divide the exponent into portions that can be used for division, to obtain smaller res- 
idue computation products. To simplify the description, extremely small numerical values are used below. 
[0107] Assume that n = 55 and e = 7 for the public key and p = 5 and q = 1 1 for the private key. 
[0108] If the message text M is assumed to be 3, the encrypted text C is obtained by using Equation 3 with n and e 
40 of the public key, from: 

3 7 - Cmod 55 

giving: 

45 

C = 42 

[0109] The value of d is selected from p and q of the private key, for decryption. 
[01 1 0] From Equation 4: 

50 

76 - 1 mod (5- 1)(11 - 1) 

in other words, from: 
55 76 - 1 mod 40 

the following is obtained: 
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d = 23 



[0111] From: ' 



5 J. ' . • 42 23 » M mod 55 



the following is obtained: 



70 

[0112] When: 



M = 3 



X m r1 mod n, 

75 Y - r2 mod n 

in the residue computations, the multiplication of Equation 8 is established, using n as modulus. 

X.Y = r1 x r2 mod n 

20 

This relational expression ensures that the exponent portion is divided into portions that can be operated upon. 
[0113] For example, 

25 = 42 5 x 42 5 x 42 5 x 42 5 x 42 3 

[0114] In this case, the two residues 42 5 and 42 3 are obtained first, as follows: 
30 42 5 - 12 mod 55 

42 3 - 3 mod 55 

[01 1 5] From Equation 8: 

35 

42 23 s 12 x 12 x 12 x 12 x 3 . r mod 55 

f T t T T 
40 rl r2 r3 r4 r5 

[0116] The residue r 1-2 is then obtained from r1 x r2. 
45 [0117] From: 

r1 x r2 e 12 x 12 » r v2 mod 55 

the following is obtained: 

50 

r i-2 =34 

[01181 In a similar manner, from: 
55 r v2 x r3 = 34 x 12 s r 1 . 3 mod 55 

the following is obtained: 
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' 1-3 -23 

Furthermore, from: 

iii « 

5 r V3 x r4 « 23 x 12 - r v4 mod 55 

the following is obtained: 



Even further, from: 

r M xr5=1x3^r mod 55 

75 the following is obtained: 

r = 3 

[011 9] It is sufficient that the maximum precision of the residue multiplications in the above process of repeated res- 
20 idue computations is two times or less the modulus n. It is therefore possible to derive the residue that is the objective, 
by repeating residue computation substituted into small numerical values. 

3. Generic Encryption and Decryption Techniques 

25 [0120] The description now turns to generic techniques used for encryption and decryption in accordance with the 
RSA method, with reference to Fig. 16. 

[0121] First of all, the values of A, m, and n are input for the residue computation relating to A m - R mod n (step S1). 
[0122] For encryption with the congruent expression A m - R mod n, the message text M is A, m is the public key e, 
and the value of R obtained by residue computations using the public key n as modulus is the encrypted text C. For 
30 decryption, the encrypted text C is A, the value of d selected from the public key e and the private keys p and q is m, 
and the value of R obtained by residue computations using as modulus is the message text M. 
[0123] First of all, the number of bits K1 in A is obtained (step S2), then the maximum exponent N of A N in the residue 
computation system is obtained from the number of bits K1 in A (step S3). 

[0124] The sequence then determines whether or not the maximum exponent N is 1 (step S4) and, if the maximum 
35 exponent N is 1 , a precision overflow is determined to have occurred at A 2 in which case: 

A m r mod n 

is obtained (step S5), the number of bits k in the residue r is obtained (step S6), and the block size of A is determined 
40 from the number of bits k (step S7). 

[0125] If the block size of A is within the precision range (step S7: No), multiplications taking n as modulus are per- - 
formed on m numbers of A, and the residue R is derived from these results (step S8). 

[01 26] It the block size of A is outside the precision range of the system (step S7: Yes), it is determined that the data 

is too large and has caused an overflow (step S9), so the block size of A is made to be smaller and the computational 
45 processing ends (step S10). 

[0127] If, on the other hand. N is greater than or equal to 2 (step S4: No), m is split into qN + a (step S1 1) and, if a is 

not zero (step S1 2: Yes), A a is obtained (step SI 3) then the remainder rl when A a is divided by n is obtained (step S14). 

[0128] The sequence then determines whether or not q = 0 (step S15) and, if q = 0 (step S15: Yes), r1 is determined 

to be the residue R (step S16). 
so [0129] If it is determined in step S1 5 that q * 0 (step S1 5: No), A N is obtained (step S1 7) then the residue r when A N 

is divided by n is obtained (step S18). 

[01 30] This residue r is substituted into rOO (step S19), multiplications taking n as modulus are performed on q blocks 
of A N (step S20), and, if a * 0 (step S21 : No), the residue obtained by dividing the product of the thus obtained multipli- 
cation result r and r1 by n is obtained (step S22). the thus obtained residue r is assumed to be the residue R that is to 
55 be obtained finally (step S23), and the residue R is output as residue data (step S24). 

[01 31] This encryption and decryption processing (residue computation) of the public-key cryptosystem based on the 
RSA method is complicated. Particularly with the encryption/decryption system, because of the need for a high level of 
precision in which n is at least 512 bits long and it is impossible to distribute the divisions as previously mentioned, it is 
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surmised that it wilt be difficult to implement hardware in accordance with the first embodiment. And the programming 
of the above described processing as software is not simple and will require a great deal of computation time. 

4. Configuration of Public-key Cryptosystem 

5 

[01 32] A block diagram of a public-key cryptosystem in accordance with the second embodiment of the present inven- 
tion is shown in Fig. 4. 

[0133] The main components of a public-key cryptosystem 1 1 are a personal computer system 12 and a power resi- 
due arithmetic device 13 that is connected to the personal computer system 12 by an input-output interface 17. 
10 [0134] The personal computer system 12 is provided with a keyboard 15 for inputting various data items, a display 16 
for presenting various display items, the input-output interface 17, and a personal computer 18 incorporating an exter- 
nal storage device such as a hard disk (not shown in the figure). 

[0135] The main components of the power residue arithmetic device 13 are an arbitrary-precision division device 20 
that divides at any desired precision within a predetermined precision range and an arbitrary-precision multiplication 
75 device 21 that multiplies to any desired precision within a predetermined precision range, connected together by a bus 
22. 

5. Configuration of Arbitrary-Precision Division Device and Arbitrary-Precision Multiplication Device 

20 [01 36] The arbitrary-precision division device 20 is provided with H numbers of 8-bit dividers, a first divider 25-1 to an 
Hth divider 25-H (where H is an integer greater than or equal to 2), to which dividend data and divisor data is input; and 
(H - 1) numbers of switches, a first switch SW1 to an (H - 1)th switch SW(H - 1), for cascading the 8-bit dividers in 
accordance with precision information data; as shown in Fig. 6, with the configuration being such that the necessary 
number of dividers can be cascaded by the first to (H - 1)th switches SW1 to SW(H - 1) based on the precision informa- 

25 tion data. 

[0137] Note that these switches SW1 to SW(H - 1) are shown merely for convenience, in a similar manner to Fig. 2. 
The configuration can also be such that all transmissions of data between the 8-bit dividers and into and out of the bank 
of 8-bit dividers is done by optical signals. 

[0138] More specifically, if a division at 8-bit precision is to be performed, all of the switches SW1 to SW(H - 1) are 
30 placed in an open state and the first divider 25-1 alone operates; if a division to 1 6-bit precision is to be performed, the 
first switch SW1 alone is placed in a closed state and the first divider 25-1 and the second divider 25-2 operate; and if 
a division at (8 x J) precision is to be performed (where J is an integer greater than or equal to 2 but less than or equal 
to (H - 1)), the first switch SW1 to the (J - 1)th switch SW(J - 1) are all placed in a closed state and the first divider 25- 
1 to the Jth divider 25-J operate. 
35 [0139] The dividend and divisor data strings are distributed in sequence between the first divider 25-1 to the Jth 
divider, in 8-bit units starting from the lowermost 8 bits. 

[0140] In a similar manner, it is possible to fabricate an arbitrary-precision power residue arithmetic device wherein a 
plurality of dividers are configured as external power residue arithmetic units having a multiplication precision of 2 m bits 
(where m is a natural number and is fixed) and a division precision of 2 2m bits; each power residue arithmetic unit is 
40 provided with cascade connection terminal for cascading input signal lines for dividend data, divisor data, and precision 
information data as well as output signal lines for quotient data and residue data; and encryption and decryption can be 
performed to a maximum 2 n -bit precision by connecting x numbers of the power residue arithmetic units in a manner 
such that the following equation is satisfied: 

45 x = 2 n /2 m (where n is a natural number and x is a natural number). 

[0141] Note that the configuration of the arbitrary-precision multiplication device 21 is substantially the same as that 
of the arbitrary-precision division device 20, except that the 8-bit dividers are replaced by 8-bit multipliers, so further 
description thereof is omitted. In addition, if the bus 22 shown in Fig. 4 is used as an optical transmission path, data 
so transmission between the arbitrary-precision multiplication device 21 and the arbitrary-precision division device 20 can 
be done by optical signals. 

6. Divider Configuration 

55 [0142] The description now turns to the configuration of a divider constructed from arbitrary-precision division devices 
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6.1. Divider Algorithm 

[0143] Before dealing with the divider itself, the description first relates to the algorithm used for division. 

[0144] Many algorithms have been proposed for division, although not as many as for multiplication. In addition, it is 

s not as if divisions are used as often as multiplications in signal processing. 

[0145] However, when divisions are necessary in applications such as encryption/decryption used as a concealment 
technique in signal regularization, computer communications, and information networks, in feature extraction within 
speech recognition, for computations of linear prediction coefficients using a frequency spectrum as a basis (70 sum- 
of -products computations and 12 divisions), and in fault-tolerant systems, they do provide an effective means in the art 

10 for achieving results rapidly and at a high precision, although their implementation is not as simple as multiplications. 
[0146] The following four methods are known for implementing such divisions: 

(1) A "reciprocal ROM method" whereby a look-up table listing the relationships between variables x and their recip- 
rocals 1/x is provided in ROM, and a division such as y/x is executed by substituting the multiplication y(1/x). 
is (2) Instead of executing the division y/x directly, a "logarithm computation method" whereby the values of log x and 
log y are obtained for x and y from a logarithm ROM, the subtraction z = log x - log y is performed, and finally the 
value of exp z is obtained from an exponent ROM. 

(3) A "subtraction shift method" whereby shifts and subtractions are repeated with respect to the dividend, by trial- 
and-error. 

20 (4) A "convergent division method" whereby the value of the quotient to be obtained is converged upon as multipli- 
cations are repeated. 

[0147] Of these methods, the reciprocal ROM method is the fastest, but it has a disadvantage in that, if a high preci- 
sion is required, the capacity of the ROM that is to hold the look-up table increases exponentially. It is therefore used in 
25 signal processing only when a comparatively coarse precision is sufficient. 

[0148] The logarithm computation method also necessitates use only after sufficient investigation of the precision, 
which is restricted by the capacity of the logarithm ROM, which means that it is difficult to call it a general-purpose 
method that can be used in any situation. 

[0149] Of the above four methods, the subtraction shift method and convergent division method enable divisions to 
30 any desired precision. 

[0150] The convergent division method views the dividend and divisor as the numerator and denominator of a fraction, 
multiplies the numerator and denominator by the same convergence coefficient until the denominator approaches 1 , 
then takes the value of the thus obtained final numerator as the quotient. Alternatively, it can be used to obtain the recip- 
rocal of the divisor by a convergence algorithm, then obtain the quotient by multiplying this reciprocal by the dividend. 

35 [0151] However, when an expansion of the division capability by connecting together a plurality of chips (modules) is 
considered, this convergent division method that performs a convergence by approximations causes problems in that 
the propagation of partial residue involves further approximations and thus large errors can occur. 
[0152] The subtraction shift method is the most suitable, in that it does not have any of the above problems, it can 
divide at any desired precision, and the division capability can be expanded by connecting together a plurality of chips 

40 (modules). 

[01 53] The discussion now concerns a function for expanding the division capability. 
[0154] To perform the following division, for example: 
567890/1234 

subtractions of the divisor from the dividend are repeated, as shown in Fig. 7A. 

45 [0155] If it is considered to use two-digit-divisor dividers (or division modules) where the divisor has been expanded 
to four digits in this case, the divisor 1234 would be split into a divisor 12 and a divisor 34 and also the dividend 567890 
would be split into a dividend 567 and a dividend 890, as shown in Figs. 7B and 7C. But it is not possible to obtain an 
expanded solution of the division (equivalent to Fig. 7A) from the results of the distributed processing using the divisors 
12 and 34, unlike with multipliers (this is possible with multiplications). 

50 [0156] Therefore, division is effective only within the range of the usable number of digits, and thus it is simply impos- 
sible to use this method if the division capability is exceeded by even one bit, no matter how much memory or external 
circuitry is used. 

[0157] In addition, rounding is usually employed in division, but this cannot be used in an encryption/decryption sys- 
tem. 

55 [0158] For the above reasons, hardware-implemented dividers are handled in the art by software that has slow 
processing speeds. 

[0159] However, this divider algorithm is based on the "subtraction shift method" whereby shifts and subtractions are 
repeated in a trial-and-error manner with respect to the dividend, so the generic basic equation thereof can be 
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expressed by the following recursive equation: 

; " R (0) /D=X2- 1 q l + (2 n R (n) )/D 

[0160J the quotient Q is 
10 2 2- l q, 



and the final residue R is 2' n R< n ). 
is [0161] The digit qj +1 of the quotient in each computation step is determined by the following size relationship between 
2R and D: 

If 2R® < D:q K1 =0 

20 If 2R a) ;> D:q^ = 1 

[0162] In other words, a temporary partial residue R^ 1 * (= 2R^ -D) is first obtained then, if the partial residue RG +1 > 
is positive or zero. q j+1 = 1 and a negative partial residue is assumed. If the partial residue R0 +1) is negative, qj +1 = 0 
and the subtraction of the temporary partial residue is canceled. 
25 [0163] The previous recurrence formula describes that the operations above are obtained by shifting R< n > by n digits 
to the right. 

[0164] However, the above recursive equation is applied simply to an n-digit quotient. When division data is dispersed 
between modules, the dividend and divisor of each module must be in a concatenated state. 

[0165] In general, the dividend has a larger number of digits than the divisor, so it becomes difficult to simply imple- 
30 ment subtractions from the divisor by concatenating a distributed divisor because of concatenation of the divisors and 
concatenation of the subtracters and latch circuits accordingly. 

[01 66] In contrast thereto, the divider of the present embodiment makes it possible to distribute computations between 
modules by having distributed divisors and subtracters that are fixed within each module, concatenating the dividend, 
and performing shifts and subtractions thereon. 
35 [01 67] In the calculations in this case, the divisor is conversely fixed to a calculation by writing and the dividend is left- 
shifted for subtractions and other operations. 

[0168] To take a simple example, the steps of the operation when a single module is expanded to two modules by 4- 
bit dividers is shown in Fig. 8. 

[0169] The divisor distributed between the two modules is concatenated by digit-borrow terminals B0 and BN of the 
40 subtracters and latch circuit. The dividend is concatenated by terminals Q1 and Q0 and is returned to an initial-stage 
R1 . With the divisor sections fixed within each module, the dividend is left-shifted by the necessary number of digits and 
subtractions are repeated therefrom, to end the computations. 

[0170] The quotient q and the partial residue are determined by the subtraction results obtained in each subtraction 
step. Note that if 8-bit subtracters are used for this operation, it will be completed at the seventh step and the eighth step 
45 merely stores the quotient qO. As a result, quotient data Q is obtained in a dividend latch/shift circuit 32 that will be 
described later (see Fig. 9) and residue data is obtained in a partial residue shift circuit 34 will be described later (see 
Fig. 9). 

6.2. Schematic Configuration of Divider 

50 

[0171] A schematic block diagram of the configuration of a divider is shown in Fig. 9, taking the first divider 25-1 as 
an example. 

[01 72] The first divider 25-1 is provided with a divisor latch circuit 31 that latches 8-bit divisor data that is input thereto, 
the dividend latch/shift circuit 32 that latches dividend data and performs a shift operation thereon, a parallel subtraction 
55 circuit 33 that subtracts a divisor corresponding to the divisor data from a dividend corresponding to the dividend data, 
the partial residue shift circuit 34 that stores partial residue data and performs a shift operation thereon, and a control 
circuit 35 that generates shift pulses necessary for the division. 
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6.3. Operation of Divider 

[0173] The operation of a divider will now be described with reference to Fig. 10. 

[01741 ' At a time t1 , a division start pulse /STAT (where V indicates "low is active" herein) falls and the first divider 25- 

s 1 is initialized until the next rise of the division start pulse /STAT (at a time t2). 

[0175] At the first time t3 that a reference clock signal CK falls after the time t2, dividend data (8 bits) is latched into 
the dividend latch/shift circuit 32 and divisor data (8 bits) is latched into the divisor latch circuit 31 . 
[0176] At a time t4 at which the reference clock signal CK next rises, a subtraction is performed between the dividend 
and the divisor in the parallel subtraction circuit 33. If the subtraction result is positive or zero, a switch 36 is turned 

io towards the parallel subtraction circuit 33 side; if the subtraction result of the parallel subtraction circuit 33 is negative, 
the switch 36 is turned towards the partial residue shift circuit 34 side. 

[0177] As a result, at a time t5, the partial residue data in the partial residue shift circuit 34 is shifted and, at the same 
time, either the partial residue data that is the subtraction result of the parallel subtraction circuit 33 or the partial residue 
data that is the pre-subtraction data stored in the partial residue shift circuit 34 is latched into the partial residue shift 
is circuit 34. 

[0178] In a similar manner, the subtraction of the divisor from the dividend and the shift-and-latch operations of the 
partial residue shift circuit are repeated between the time t5 and a time t6 for the number of times necessary for that 
dividend, in other words, seven times corresponding to the remaining seven bits if the complete dividend is eight bits 
long. 

20 [0179] When the reference clock signal CK falls at the time t6, the division end signal /DEND goes low and the oper- 
ation of the upper-digit subtracter (in this case, the second subtracter 25-2) starts in a similar manner. 
[0180] Similar divisions are performed in the upper-digit subtracters 25-3 to 25-J, and the subtractions end when the 
division end signal /DEND of the final subtracter 25-J is input to the first subtracter 25-1 . 

[0181] As a result, the quotient data (8-bit) is obtained in the dividend latch/shift circuits 32 of the subtracters 25-1 to 
25 25-J and the residue data is obtained in each of the partial residue shift circuits 34. 

[01 82] Since all of the above operations are performed in synchronization with shift pulses SLA output from the control 
circuit 35, there are no timing-related errors due to propagation delays, even when the division capability is expanded 
by cascading a large number of subtracters. 

30 7. Encryption and Decryption Processing (Residue Computation) 

[0183] The encryption and decryption processing (residue computation) of the public-key cryptosystem based on the 
RSA method will now be described, with reference to Fig. 1 1 . 

[0184] The values of A, N, and n are input to this encryption/decryption system (step S31) for the computation of: 

35 

A N - Amodn 

[0185] Next, A is substituted into A1 (step S32) and the initial value of a computational variable k is set to 1 (step S33). 
[0186] The computations of the subsequent steps S34 to S38 are repeated while the computational variable k 
40 remains below N, in other words, while the following inequality is satisfied (step S39): 

k< N. 

[0187] The sequence first determines whether or not N = 1 (step S34). 
45 [0188] If it is determined in step S34 that N = 1 , A1 is substituted into AO (step S35). 

[0189] If N * 1 , A1 and A are multiplied, the multiplication result is substituted into AO (step S36), AO = A mod n is 
obtained (step S37), and the computational variable k becomes k + 1 (step S38). 

[0190] When the computational variable k becomes greater than N, the thus obtained residue A becomes the target 
residue. During this time, the maximum precision of the multiplications is twice the number of bits in A. 
so [0191] Thus use of the power residue arithmetic device of this first embodiment makes it possible to greatly simplify 
the processing, in comparison with the prior-art processing of Fig. 16. 

8. Number of Gates of Power Residue Arithmetic Device 

55 [0192] The description now turns to the number of gates necessary for configuring the power residue arithmetic 
device in this case. 

[0193] It is well know that, since residue computations in an encryption/decryption system are done to an extremely 
high level of precision, the multiplication precision of a multiplier in one module is 16 bits and the division precision of a 
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divider in one module is 32 bits, so the number of gates when four modules are incorporated within one unit (to give a 
variable precision of 16 to 64 bits for multiplication and 32 to 128 bits for division) is" as shown in Fig. 12. 
[0194] Fig. 1 2 shows that, 1 700 gates are necessary for configuring a 32-bit precision divider of one module, so that 
the number of gates necessary for the arbitrary-precision division devices within one power residue arithmetic device is: 

5 

1700 x 4 = 6800 (gates). 

[0195] Fig. 12 also shows that 1032 gates are necessary for configuring a 16-bit precision multiplier of one module, 
so that the number of gates necessary for the arbitrary-precision multiplication devices within one power residue arith- 
10 metic device is: 

1032 x 4 = 41 28 (gates). 
[0196] Therefore, the total number of gates required for the power residue arithmetic device is: 

is 

6800 + 4128 = 10928 (gates). 

[0197] This makes it possible to place one power residue arithmetic device on a single chip with approximately 12000 
gates, including those required for the control circuit and dynamic connection circuit. 
20 [0198] Note that eight units are necessary for 512-bit precision and 16 units are necessary for 1024-bit precision. 

9. Computation Time 

[0199] The description now concerns the time required for computing a b « M mod c using the above described power 

25 residue arithmetic device, with reference to Figs. 13Ato 15. 

[0200] If the exponent b is n bits long, the computational processing necessary for the power residue computation 
can be broken down into the four multiplication and division steps indicated by (1) to (4) in Fig. 13A. In this case, the 
multiplications of steps (1) and (3) are done by an i-bit multiplier 21 shown in Fig. 13B and the divisions of steps (2) and 
(4) are done by a 2i-bit divider 20 shown in Fig. 13B. The multiplication precision of the multiplications of steps (1) and 

30 (3) is less than or equal to the maximum number of bits i in c, and the product thereof has a precision of twice the 
number of bits i in c, at most. Similarly, the division precision of the divisions (residue computations) of steps (2) and (4) 
is twice the number of bits i in c, at most. The i-bit multiplier 21 and the 2i-bit divider 20 have the same configuration as 
the arbitrary-precision multiplication device 21 and the arbitrary-precision division device 20 of Fig. 5. 
[0201] Note that the input values in Fig. 136 are a, b, and c, and the register for storing M is initialized to an initial 

35 value of 1 for M. The values of a and M are input to the i-bit multiplier 21 for the computation of M <- M x a mod c (when 
b = 0), and the remainder when the result is divided by c is stored as M. For the computation of a <- a 2 mod c (when b 
= 1), a 2 is obtained by inputting two values of a to the i-bit multiplier 21 . 

[0202] In this case, the computations of r1 2 , r2 2 , r3 3 3,... at (1) in Fig. 13A are done by the i-bit multiplier 21 , and the 
remainders when r1 2 , r2 2 ,... are each divided by c at (2) in Fig. 13A are obtained by the 2i-bit divider 20 as intermediate 
40 values of M. At (3) in Fig. 13A, the multiplications k1 = r1 x r2 , k2 = R0 x r3 , k3 = R1 x r4 .... are done by the i-bit mul- 
tiplier 21 . Then the remainders when K1 , K2,... are each divided by c are obtained as R0, R1 .... (intermediate values of 
M) by the 2i-bit divider 20, and the remainder Rn - 3 obtained by dividing kn - 2 by c in the final stage becomes the target 
value of M. 

[0203] An actual example of these computations is shown in Fig. 13C. Fig. 13C shows the calculation sequence for 
45 obtaining M = 17 by decrypting 186 19 » M mod 377. 

[0204] In Fig. 13C, the residue values "289", "204", "146", and "204" are each obtained as remainders when 186 2 , 
289 2 , 204 2 , and 146 2 are divided by 377. The products of (3) are obtained as 53,754 = 186 x 289 and 44,880 e 220 x 
204. Then the value M =17 of (4) is obtained as the remainder when 44,880 is divided by 377. 

[0205] If the maximum computation time required when all of a, b, and c in Fig. 13A have 1024 bits is expressed as 
so the number of reference clock pulses that is necessary therefor, 1 ,047,552 clock pulses are necessary for the multipli- 
cation time in (1), 2,095,104 clock pulses are necessary for the residue computation time in (2), 1 ,047,552 clock pulses 
are necessary for the multiplication time in (3), and 2,095,104 clock pulses are necessary for the residue computation 
time in (4), making a total of 6,285,312 clock pulses, as shown in Fig. 14. 

[0206] This means that the computation time is 95.2 ms when the reference clock frequency is 66 MHz. 
55 [0207] Similarly, the computation time when a and b each have 1 024 bits and the number of bits in the exponent b is 
512 bits or 9 bits is 47.6 ms or 0.7 ms, respectively, as shown in Fig. 15. It should be noted, however, that these com- 
putation times are results derived when each product and residue is obtained to the maximum precision, so in practice 
these values will be smaller. 



18 

BNSDOCID: <EP 097491 3 A1_1_> 



EP 0 974 913 A1 



[0208] This second embodiment of the invention makes it possible to configure a power residue arithmetic unit that 
can perform power residue computations at a multiplication precision of 2 m bits (where m is a natural number) and a 
division precision of 2 2xm bits, and encryption and decryption at a 2 n -bit precision can be performed by connecting x 
numbers of these power residue arithmetic unit in a manner such that the following equation is satisfied: 

5 ... 

x = 2 n /2 m (where n is a natural number and x is a natural number). 
[0209] It is therefore possible to expand the system in a simple hardware manner. 

[021 0] Note that the above relational expression of x is that when the maximum arithmetic precision of the power res- 
io idue arithmetic unit is equal to the arithmetic precision that is required, but the same effects can be obtained by adding 
a margin to this computational capability when x power residue arithmetic units are connected together in a manner 
such that the following inequality is satisfied: 

x > 2 n /2 m (where n is a natural number and x is a natural number). 

75 

[021 1] In addition, if it is required to have a precision 2 n1 (where n1 is a natural number, and also, n1 < n) that is lower 
than the maximum possible bit precision 2 n of the power residue arithmetic unit, the power consumption can be opti- 
mized by not supplying power to power residue arithmetic units that are not necessary for the computations, and also 
the computation time can be optimized, by cascading x1 numbers of power residue arithmetic units in a manner such 
20 that the following inequality is satisfied: 

x £ x1 ^ 2 n1 /2 m (where x1 is a natural number). 

[0212] It is also possible to configure the system of y numbers of cascadable power residue arithmetic modules that 
25 can perform power residue computations at a multiplication precision of 2™^ and y numbers of those with a division pre- 
cision of 2 2x(m/y) . The power consumption and computation time can be optimized in such a case by cascading only 
y1 ^ 2 n1 /2 m/ y numbers of the power residue arithmetic modules that is sufficient for performing encryption and decryp- 
tion at a 2 n1 -bit precision, out of the total number of power residue arithmetic modules that are connected together. 
[0213] This second embodiment of the invention also makes it possible to achieve a precision of 512 to 1024 bits, 
30 using only single chips with a gate count of approximately 12,000 gates, in which arbitrary-precision dividers have a 
128-bit precision and implement divisions by distributed processing and arbitrary-precision multiplier have a 64-bit pre- 
cision, by cascading 8 to 16 of these multiplication/division units (chips). Higher precisions can be implemented simply, 
in a similar manner. 

[0214] Since it is possible to execute computations at a precision that is proportional to the number of clock pulses 
35 (such as 512 clock pulses for a 512-bit precision) and conf igure all of the hardware on a board, higher speeds can be 
achieved. 

[021 5] It is also possible to construct an encryption/decryption system in a simple manner on each personal computer 
system, and thus it is possible to implement an encryption/decryption system that is suitable from the information secu- 
rity viewpoint. 

40 [0216] The description above concerned a case in which a maximum of x power residue arithmetic units were pro- 
vided, but rf the power residue arithmetic units are configured of thin film transistors (TFTs), by way of example, and _ 
damage or another cause has made one transistor inoperative so that one power residue arithmetic unit cannot be 
used, this would mean that none of the power residue arithmetic units can be used, which will reduce the effective yield 
and lead to an increase in fabrication costs. 

45 [0217] For a yield of power residue arithmetic units of A in such a case, it is preferable to use a redundant configura- 
tion such that at least K (where x £ K/A ) power residue arithmetic units are provided in advance and x numbers of 
these K power residue arithmetic units that are operative are connected together. If one of the power residue arithmetic 
modules in this configuration is found to be inoperative, a redundant power residue arithmetic module can be switched 
into the connection and the corresponding power residue arithmetic unit can be handled as a operative unit. 

so [0218] It should be obvious that this applies not only to power residue arithmetic units, but also to the power residue 
arithmetic modules making up each of these power residue arithmetic units. In other words, for a yield of power residue 
arithmetic modules of A', a redundant configuration is employed in which at least L (where y £ UA*) power residue 
arithmetic modules are provided in advance and y numbers of these L power residue arithmetic modules that are oper- 
ative are connected together. Thus, if one of the power residue arithmetic modules in this configuration is found to be 

55 inoperative, a redundant power residue arithmetic module can be switched into the connection and the corresponding 
power residue arithmetic unit can be handled as a operative unit. 

[021 9] The description above concerned applications in which RSA encryption was used as the public-key cryptosys- 
tem, but it is also possible to utilize elliptical-curve encryption in a similar manner. 
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Third Embodiment 

[0220] In the above described second embodiment, all of the power residue arithmetic devices were provided as 
external additions to a personal computer system, but this third embodiment concerns a configuration in which some of 
5 the power residue arithmetic devices are installed beforehand in the personal computer system, an expansion slot for 
power residue arithmetic units for expanding the precision is also provided in the personal computer system, and the 
simply arithmetic precision can be expanded by inserting power residue arithmetic units into the expansion slot, in a 
similar manner to expanding memory by using a memory expansion slot. 

[0221] More specifically, a motherboard of the personal computer system 1 1 shown in Fig. 4 is provided with internal 
w power residue arithmetic units capable of performing power residue computations at a multiplication precision of 2 m1 
bits (where ml is a natural number and is fixed) and a division precision of 2 2xm1 bits, together with an expansion slot 
for power residue arithmetic units that acts as cascade connection terminals for cascading external power residue arith- 
metic units capable of performing power residue computations at a multiplication precision of 2 m2 bits (where m2 is a 
natural number and is fixed) and a division precision of 2 2xm2 bits. External power residue arithmetic units are con- 
15 nected to the expansion slot for power residue arithmetic units in a manner such that the following inequality is satisfied: 

z £(2 n - 2 m1 )/2 1712 (where n and z are natural numbers and are fixed). 

[0222] As a result, it is possible to perform encryption and decryption in a simple manner at 2 n -bit precision. These 
20 external power residue arithmetic units can also be modified in various ways, as previously described with respect to 
the second embodiment. 

[0223] The descriptions of the above embodiments dealt with cases in which the present invention is implemented by 
personal computers, but it can also be applied in a similar manner to workstations, measuring equipment, and comput- 
ers incorporated into equipment such as domestic appliances. 

25 

Fourth Embodiment 

[0224] The description now turns to optical transmission within a single chip, as mentioned in the first to third embod- 
iments. 

30 

Outline of Microcomputer: 

[0225] A block diagram of an example of a microcomputer 100 that is an information processing system to which the 
present invention is applied is shown in Fig. 17. 

35 [0226] The microcomputer 100 shown in Fig. 17 comprises the various functional units described below. That is to 
say, the interior of the microcomputer 100 is provided with a central processing unit (CPU) 102; read-only memory 
(ROM) 104; random access memory (RAM) 106 used as cache memory; a high-frequency oscillation circuit 108; a low- 
frequency oscillation circuit 1 10; a reset circuit 1 12; a prescaler 1 14; a timer circuit comprising a 16-bit programmable 
timer 1 16, a 8-bit programmable timer 118. and a clock timer 120; data transmission control circuitry comprising an 

40 intelligent direct memory access controller (DMAC) 122 and a high-speed DMAC 124; an interrupt controller 126; a 
serial interface 128; a bus control unit (BCU) 130; an analog interface circuit comprising an analog-digital (A/D) con- 
verter 132 and a digital-analog (D/A) converter 134; and I/O circuitry comprising an input port 136, an output port 138, 
and an input-output (I/O) port 140. In addition, the microcomputer 100 comprises an internal bus line 146 comprising a 
data bus 1 42 and an address bus 1 44, that connects the CPU 1 02 to the other functional units 1 04 to 1 40, and various 

45 terminals. All of these components are formed on a single semiconductor substrate. Note that the data bus 1 42 and the 
address bus 144 are shown separately in Fig. 17, but a single line could be used as a common data/address bus, for 
transmitting both data and addresses, if the data and addresses are transmitted in a time division manner. 
[0227] The internal bus line 146 shown in Fig. 17 is connected to the internal bus 4 of Fig. 2, whereby all of the above 
components are connected to a programmable digital processor 3. 

50 

Transmission of Optical Signals between Functional Units: 

[0228] The present embodiment is characterized in that signals transmitted by the data bus 1 42, the address bus 1 44, 
and the internal bus 4 are optical signals. In other words, multi-channel data such as 32-bit data is handled as optical 
55 signals of different wavelengths which are transmitted simultaneously in an optical manner within the data bus 142 
formed as an optical transmission medium. Similarly, address signals are handled as optical signals of different wave- 
lengths which are transmitted simultaneously in an optical manner within the address bus 144. Note that if a single opti- 
cal transmission medium is used in common for the data bus 142 and the address bus 144, data and addresses in the 
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optical signal are transmitted in a time division manner. 

[0229] In such a case, all of the above described functional units 102 to 140 operate in accordance with multi-channel 
electrical signals to implement their various functions, as in the conventional art, and thus can be formed as semicon- 
ductor components or the like. To that end, each of the functional units 102 to 140 is provided with one or both of a sig- 
s nal input section for converting multi-channel electrical signals that are input through the internal bus line 146 (the data 
bus 142 and/or the address bus 144) into multi-channel optical signals, and a signal output section for converting multi- 
channel electrical signals into multi-channel optical signals. 

[0230] Part of a substrate 200 on which is formed the microcomputer 100 is shown schematically in Fig. 18. This Fig. 
1 8 shows a configuration in which a signal is transmitted through the internal bus line 1 46 from a first functional unit 210 
io (one of the functional units 1 02 to 1 40) formed on the substrate 200 to a second functional unit 220 (another of the func- 
tional units 102 to 140). 

[0231] The first functional unit 210 comprises a electrical circuit region 212, a wiring section 214 that transmits an 
output signal (multi-channel electrical signals) from this electrical circuit region 212, and a light-emitting section 216 that 
acts as a signal output section for outputting an optical output that is multichannel optical signals having different wave- 

15 lengths based respectively on multi-channel electrical signals. 

[0232] The second functional unit 220 comprises a light-receiving section 222 that acts as a signal input section for 
converting multi-channel optical signals into multi-channel electrical signals, an amplifier circuit 224 for amplifying the 
multi-channel electrical signals, a wiring section 226 for transmitting this electrical signal, and an electrical circuit region 
228. Note that the amplifier circuit 224 functions as a level shifter for shifting the voltage level of the electrical signal from 

20 the light-receiving section 222 to the voltage level necessary for the second functional unit 220, and is provided as nec- 
essary. 

[0233] The internal bus line 146 formed between the light- emitting section 216 and the light-receiving section 222 is 
configured as a waveguide 230 that is an optical transmission medium which can transmit multi-channel optical signals 
simultaneously. 

25 [0234] To enable the light-emitting section 216 shown in Fig. 18 to convert multi-channel electrical signals into multi- 
channel optical signals, it is provided with a number of light-emitting elements corresponding to the number of channels. 
In a similar manner, the light-receiving section 222 has light-receiving elements corresponding to the number of chan- 
nels. 

[0235] Note that signals can be transmitted in both directions between the first and second functional units 210 and 
30 220. In such a case, the first functional unit 210 is provided with a light-receiving section connected optically to the 
waveguide 230 that is the optical transmission medium in Fig. 18, as well as an amplifier for amplifying an electrical sig- 
nal from that light-receiving section. Similarly, the second functional unit 220 is provided with a light-receiving section 
222 that is connected optically to the waveguide 230. 

[0236] A cross-section through an example of three light-emitting elements 216A to 216C and the waveguide 230, 
35 which is part of the light-emitting section 216 of Fig. 18, is shown in Fig. 19. Fig. 19 shows a state in which light-emitting 
elements 216A to 216C for three channels are formed on top of the common waveguide 230. 

[0237] In Fig. 1 9, the waveguide 230 is configured in a manner such that a transparent electrode layer 244 formed of 
a material such as indium tin oxide (ITO) is provided as a core between a lower Si0 2 layer 240 and an upper Si0 2 layer 
242. Note that all of the outer surfaces of the ITO layer 244 that forms the core are covered by the Si0 2 layer 240 and 

40 the Si0 2 layer 242, preventing the leakage of light therefrom. 

[0238] Since the light-emitting elements 21 6A to 21 6C have essentially the same configuration, apart from the com- - 
position and materials of some of the layers, as will be described later, the description below deals only with the light- 
emitting element 216A. The light- emitting element 216A comprises banks 250 for delimiting the light-emitting sections 
of the channels, on top of the upper Si02 layer 242. Between these banks 250 are formed an ITO layer 252 and a light- 

45 emitting layer 254 in sequence, and metal electrodes 256 (of, for example, Al-Li) are formed to cover parts of the light- 
emitting layer 254 and the banks 250. Note that a optical filter that allows the passage of only wavelengths within a nar- 
row band could be provided below the ITO layer 252. In this manner, a plurality of light-emitting elements 216A to 216C 
and so on, which are optically isolated by the banks 250, are formed on the waveguide 230. 

[0239] The light-emitting layer 254 of the light-emitting element 216A is formed of an organic electroluminescence 
so (EL) material, by way of example. This organic EL is formed by ejection from inkjet nozzles 258 onto the ITO layer 252 
to a thickness of approximately 0.1 urn, by way of example. By selecting the organic EL material, the wavelength of light 
emitted from the light-emitting layer 254 of the light-emitting element 21 6A of a certain channel can be made different 
from the wavelengths of light emitted from the light-emitting elements 216B, 21 6C, etc., of all the other channels. 
[0240] Using organic EL as the light- emitting layer 254 increases the freedom with which the wavelengths of the emit- 
55 ted light can be selected, and in fact it is possible to select any desired wavelength by selecting a specific material or 
compounding materials. 

[0241 ] For the organic light-emitting material, the energy of an exciter within the light-emitting material can be selected 
to be suitable for the energy difference between the highest occupied molecular orbital (HOMO) and the lowest unoc- 
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cupied molecular orbital (LUMO) corresponding to the forbidden band of an organic substance. For example, a low- 
molecular weight material, a high-molecular weight material (particularly a conjugate high-molecular weight material 
achieved by conjugation of a principal chain), conductive molecules, or pigment molecules can be selected 
[0242] therefor.; If a low-molecular weight organic material is used as the organic light-emitting material, anthracene, 
5 PPCP, ZnjfOxZk, distyrylbenzene (DSB), a derivative thereof (PESB), or the like could be used for emitting blue light. 
Similarly, BPPC, perylene, DCM, or the like could be used for emitting red light. 

[0243] If a high-molecular weight organic material is used as the organic light- emitting material, PAT or the like could 
be used for emitting red light; MEH-PPV could be used for emitting orange light; PDAF, FP-PPP, RO-PPP, PPP, or the 
like could be used for emitting blue light; and PMPS or the like could be used for emitting purple light. 
io [0244] Other materials that could be used as the organic light-emitting material include PPV, RO-PPV, CN-PPV, 
PdphQ x , PQ X , poly-N-vinylcarbazol (PVK), PPS, PNPS, and PBPS. 

[0245] In particular, PVK can be modified to give any oscillation wavelength (light- emitting color), by controlling the 
mixing density and number of ink injections of a dopant ink such as pigment molecules, which have a poor carrier trans- 
port capability in a Eu complex. The color that is generated can be adjusted by doping a fluorescent pigment into an 
is organic light- emitting material formed of PVK, by way of example. 

[0246] If the configuration is such that rhodamine B or DCM can be doped into PVK, the color that is generated can 
be modified as desired from green to red. 

[0247] In addition the wavelength of the light (the peak wavelength or wavelength band thereof) can be adjusted to a 
certain degree by an additional optical filter disposed below the ITO layer 252 of Fig. 19. 
20 [0248] If light of a wide wavelength band, such as white light, is emitted and the wavelength thereof is adjusted, an 
absorbent optical color filter can usually be used as the above described optical filter and this can be adjusted so that 
only light of a desired color (wavelength) is passed therethrough as an optical signal. 

[0249] A distributed reflective multi-layer mirror (DBR mirror) could be used as the additional optical filter disposed 
below the ITO layer 252. This DBR mirror is a stack of a plurality of thin films having different refractive indices, and, in 

25 particular, comprises a plurality of pairs of thin films having different refractive indices. The compositions of these thin 
films could be semiconductor or dielectric materials, by way of example, but dielectric materials are preferred. These 
could be formed by using a usual vacuum film-formation method or liquid-phase film-formation method. A soluble 
organic compound in an organic solvent could be used as the starting material for the dielectric material, in which case 
it is possible to apply a pattern by the inkjet method shown in Fig. 20. 

30 [0250] The light-emitting elements 216A to 216C could also be formed of vertical-cavity surface-emitting lasers. Such 
a surface-emitting laser is formed of a stack of alternate cladding layers and active layers (preferably of a quantum well 
structure) formed between two mirrors of different reflectivities, such as two distributed reflective multi-layer (DBR) mir- 
rors (in the example shown in Fig. 19, the lower mirror has a lower reflectivity), with contact layers disposed between 
each of the upper and lower mirrors and corresponding upper and lower electrodes (in the example shown in Fig. 19, 

35 the lower electrode is a transparent electrode made of a material such as ITO). 

[0251 ] Note that details of this type of surface- emitting laser have already been disclosed by the present inventors in 
Japanese Patent Application No. 10-2012415, Japanese Patent Application No. 10-201244, and Japanese Patent 
Application Laid-Open No. 7-198203, etc., so further description thereof is omitted. 

[0252] It is possible to modify the light-emitting wavelength of the surface-emitting laser of each channel by selecting 
40 the composition of a material grown by epitaxial growth, such as GaAIAs, and a plurality of surface-emitting lasers of 
different compositions could be created on top of the waveguide 230 by patterning with the inkjet method shown in Fig. 
20. 

[0253] Another example of the light-emitting element 21 6A could use a edge-emitting laser 270, as shown in Fig. 22. 
Light 274 emitted from an edge surface 272 of this edge-emitting laser 270 is propagated within a waveguide 276 

45 formed in an optical transmission medium of a material such as ITO. The wavelength of light emitted by the edge-emit- 
ting laser of each channel can be modified by selecting the composition of the construction material thereof. 
[0254] Fig. 21 shows an example of light-receiving elements 222 A to 222C for three channels provided on top of an 
extension of the waveguide 230 of Fig. 19. Since these light-receiving elements 222 A to 222C have essentially the 
same configuration, apart from the composition and materials of some of the layers, as will be described later, the 

so description below deals only with the light- receiving element 222A. This light-receiving element 222A comprises an 
optical filter 262, an ITO layer 263 acting as a transparent electrode, a first conductive semiconductor layer such as an 
n-type GaAIAs layer 264, and a second conductive semiconductor layer such as a p-type GaAIAs layer 266, formed on 
the ITO layer 244 in a region away from the upper Si02 layer 242 of the waveguide 230. 

[0255] In this case, the first and second conductive semiconductor layers 264 and 266 form a PIN photodiode. 
55 Another example of the configuration of this PIN photodiode could comprise a p-type a-SiC (p-type semiconductor), an 
i-type a-Si layer, and an n-type a-SiC layer. In such a case, a Al-Si-Cu layer could be used as the metal electrode 268. 
[0256] The optical filter 262 comprises at least two layers, such as a first optical filter that allows the passage of light 
of a wavelength greater than or equal to X1 nm and a second optical filter that allows the passage of light of a wave- 
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length less than or equal to X2 (> X1) nm, by way of example, to function as a narrow-band optical filter for passing light 
of wavelengths between X1 nm and X2 nm. Note that the configuration of the optical filter 262 could also be such that 
an additional optical filter disposed below the ITO layer 252 of Fig. 19 is used therefor. Conversely, one of the materials 
described for the : optical filter disposed additionally below the ITO layer 252 of Fig. 1 9 could be used in the optical filter 

5 262.//, 

[0257] The above configuration ensures that light of a specific wavelength that has been propagated by the waveguide 
230 and passed through the optical filter 262 is converted from light into a current at the depletion layer formed at the 
boundary between the first and second conductive semiconductor layers 264 and 266. and that electrical signal can be 
extracted through the electrodes 263 and 268. 

10 [0258] Incidentally, the wavelength of the light that passes through the optical filter 262 can be modified by a suitable 
selection of the material of which it is constructed. In addition, the wavelength of the light that can be detected by the 
photodiode formed of the first and second conductive semiconductor layers 264 and 268 can be modified by selecting 
the composition of the material thereof, such as GaAIAs. This ensures that the wavelength of light detected by the light- 
receiving element 222A of one channel is different from the wavelengths detected by the light-receiving elements 222B. 

is 222C. etc., of all the other channels. Note that the optical filter 262 and the first and second conductive semiconductor 
layers 264 and 268 could be formed by a film-formation method using the inkjet nozzles 258 shown in Fig. 20. Alterna- 
tively, the light-receiving elements 222A, 222B, and 222C that detect different wavelengths could be mounted on the 
waveguide 230. 

20 Description of Microcomputer Operation 

[0259] If the CPU 102 of Fig. 17 is to read data from the RAM 106 which is acting as cache memory, for example, 
signals such as a chip enable or chip select signal from the CPU 102 to select the RAM 106 and address signals to be 
read out are output as multi-channel electrical signals. 

25 [0260] In this case, if the first functional unit 210 of Fig. 18 is the CPU 102, the multi-channel electrical signals there- 
from are output by the electrical circuit region 21 2 of Fig. 1 8. These multi-channel electrical signals are input to the light- 
emitting section 216 through the wiring section 214. The light-emitting elements that form the light-emitting section 216 
each input one of the channels of the electrical signal through the metal electrode 256 thereof, and an optical signal of 
that channel's own wavelength is emitted by the corresponding light- emitting layer 254. This generates an optical signal 

30 that is controlled for each channel from the light-emitting layer 254, and these signals are incident on the waveguide 
230 through the ITO layer 252. The waveguide 230 propagates light through the ITO layer 244 that forms the core 
thereof, as shown in Fig. 19. 

[0261] If the second functional unit 220 of Fig. 18 is assumed to be the RAM 106, the optical signal propagated 
through the waveguide 230 is incident on the light-receiving section 222 thereof. In this case, each of the light-receiving 

35 elements 222A to 222C that make up the light-receiving section has the optical filter 262 which passes light of the cor- 
responding wavelength, so that only light of wavelength corresponding to each channel is incident on the light-receiving 
elements 222A to 222C that are disposed to correspond to the number of channels. In addition, each of the light-receiv- 
ing elements 222A to 222C is configured to detect light of the wavelength that has passed through the corresponding 
optical filter 262, so that an electrical signal for each channel is output therefrom. 

40 [0262] The RAM 106 operates electrically thereafter, in a similar manner to a conventional LSI, so that the necessary 
data is read out from the RAM 106. Electrical signals that have been read from the RAM 106 are converted by the light- - 
emitting section on the second functional unit 220 side, omitted from Fig. 18, and are propagated through the 
waveguide 230 of Fig. 18. These optical signals are converted into electrical signals by the light-receiving section on 
the first functional unit 210 side, then are supplied to the CPU 102 through an amplifier. 

45 [0263] In this manner, losses in transmission speeds can be substantially ignored, because signals are transferred 
as light between the functional units within the microcomputer 100 acting as an information processing system. This 
means that it is not necessary to design from considerations of transmission delays, even if the trend for higher levels 
of integration continues, greatly facilitating the structural design of circuitry. Moreover, the amount of heat generated by 
the microcomputer 100 can be reduced by this transfer of signals as light between the functional units within the micro- 

50 computer 100. 

[0264] Note that it is also possible to have a configuration in which optical communications are used between the 
microcomputer 100 or Fig. 17 and peripheral equipment. This peripheral equipment could be a display section such as 
an LCD or CRT, or a printer or the like. 

[0265] In such a case, the input port 136, the output port 138, and the I/O port 140 of Fig. 17 could be configured to 
55 input and output multi-channel optical signals with respect to the internal bus line 146. The configuration in this case 
could be such that an optical shutter is provided for each of these ports, to close off the path when optical signals are 
not being input or output therethrough, and this optical shutter is opened only when peripheral equipment specifies it, 
by means of a chip select signal or the like. 



23 



EP 0 974 913 A1 



Fifth Embodiment 

[0266] A fifth embodiment of the present invention relates to a system LSI wherein a plurality of functional units, which 
are integrated onto a single chip as an LSI in the art, are distributed between different semiconductor devices. 

5 [0267] In a layout shown in Fig. 23, a system LSI formed on a base substrate 306 is configured of a first semiconduc- 
tor device 300 that forms part of the microcomputer 100 of Fig. 17 and a second semiconductor device 302 that forms 
all or part of the remainder thereof, and the first and second semiconductor devices 300 and 302 are connected 
together by a external bus line 304. In addition to the waveguide 230 shown in Fig. 18, the external bus line 304 could 
be formed of an optical fiber, for example, and signals can be transmitted between the first and second semiconductor 

io devices 300 and 302 by light. Note that a plurality of functional units are disposed within each of the first semiconductor 
device 300 and the second semiconductor device 302, and signals are transmitted between the functional units within 
each semiconductor device by light, in a similar manner to the first embodiment. 

[0268] In this case, at least the RAM 1 06 of Fig. 1 7 acting as a cache memory, the ROM 104, and an internal bus line 
310 formed of an optical transmission medium are disposed within the second semiconductor device 302 of Fig. 23, 
is mounted on a second substrate 312. Similarly, the CPU 102 and the internal bus line 146 of Fig. 17, together with the 
functional units 108 to 140 that are not incorporated in the second semiconductor device 302, are provided within the 
second semiconductor device 300 of Fig. 23 and are mounted on a first substrate 308. 

[0269] This configuration makes it possible to transmit data and addresses through the internal bus line 1 46 as optical 
signals between the CPU 102 within the first semiconductor device 300 and the other functional units 108 to 140 incor- 
20 porated within the first semiconductor device 300. Similarly, data and addresses can be transmitted as optical signals 
between the CPU 1 02 within the first semiconductor device 300 and the ROM 1 04 and RAM 1 06 within the second sem- 
iconductor device 302 through the internal bus line 146 within the first semiconductor device 300, the external bus line 
304, and the internal bus line 310 within the second semiconductor device 302. 

[0270] This makes it possible to remove at least one of the functional units that have to be accommodated within the 
25 single chip with the CPU 102 in the art, and place it externally, and also makes it possible to ignore transmission delays 
between the CPU 102 and the functional units 104 and 106 that have been relocated externally. This also makes it pos- 
sible to have a lower level of integration for the first semiconductor device 300 on a single chip, improving the yield and 
also making it possible to externally attach functional units that are accessed directly by the CPU 102, such as the ROM 
104 and RAM 106. It is therefore possible to freely design additional functions to be attached as external components, 
30 while keeping the basic single-chip design concept in common. Note that the functional units incorporated within the 
second semiconductor device 302 are not limited to memory components as described above. 

Sixth Embodiment 

35 [0271] A schematic illustrative view of a sixth embodiment of the present invention is shown in Fig. 24. As shown in 
Fig. 24, the light-emitting sections, light-receiving sections, and waveguides of the present embodiment are disposed in 
a three-dimensional manner. In Fig. 24, first to fifth layers 410a to 410e are formed on a substrate 400. The relationship 
between a light-emitting section 413 and a light-receiving section 427 in the fifth layer 41 Oe is the same as the relation- 
ship between the light-emitting section 216 and the light-receiving section 222 connected by the horizontal waveguide 

40 230 of Fig. 18. The relationship between the light- emitting section 413 and a light-receiving section 424, the relationship 
between a light-emitting section 414 and light-receiving sections 422, 425, and 426, the relationship between a light- 
emitting section 412 and a light-receiving section 421, and the relationship between a light-emitting section 423 and a 
light-receiving section 411 are basically the same as the relationship between the light-emitting section 216 and the 
light-receiving section 222 of Fig. 18, except that the waveguides therebetween have vertical portions. 

45 [0272] This device is preferably fabricated as described below. First of all. the first to fifth layers 410a to 410e are 
formed on separate first to fifth substrates, respectively. The first substrate on which is formed the first layer 41 0a could 
act as the substrate 400 shown in Fig. 24, but otherwise the first layer 410a is peeled from the first substrate and is 
transferred to the substrate 400 shown in Fig. 24. Thereafter, each of the second layers 410b to 410e is peeled from 
the corresponding second to fifth substrates, then is transferred and positioned to form a stack on the substrate 400 in 

50 the sequence shown in Fig. 24. This peeling and transfer method has already been disclosed by the present inventors 
in Japanese Patent Application No. 8-225643, so further description thereof is omitted. 

[0273] This device can achieve effects similar to those of the fourth embodiment, and can also make it possible to 
obtain an even higher degree of integration. 

[0274] Note that the present invention is not limited the above described embodiments, and various modif ications can 
55 be made within the scope of the present invention. For example, the above embodiments used organic EL and semi- 
conductor lasers by way of example, but the present invention is not limited thereto and thus the configuration can also 
include inorganic EL and light-emitting diodes. In addition, the light-receiving elements are not limited to PIN photodi- 
odes, and thus other types of diode such as PN photodiodes or avalanche photodiodes, phototransistors. or photo-lumi- 
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nescent components could be used therefor. 

[0275] In addition, the microcomputer that is the information processing system of the present invention is not limited 
to having the functional units shown in Fig. 1 7; various other structures and types of functional unit could be provided 
therefor, as appropriate. In particular, if one system LSI is used as a microcomputer on a single chip, as in the fifth 
5 embodiment, some of the functional units that are necessary for a conventional microcomputer could be omitted. The 
functional units omitted from that chip could be incorporated into another semiconductor device that is connected by an 
external bus line to that chip. 

[0276] Furthermore, electronic equipment incorporating the microcomputer or system LSI of the present invention 
could be implemented favorably as any appliance that comprises a microcomputer in the art, particularly when lower 
io power consumptions are required, such as portable electronic equipment, information communications equipment, 
domestic or commercial electronic equipment, machine tools, and automobile components. 

Seventh Embodiment 

75 [0277] The description now turns to a system LSI in accordance with a seventh embodiment of the present invention, 
implemented by the application of the configuration of the second or fourth embodiment, with reference to Fig. 25. 
[0278] A system LSI 500 shown in Fig. 25 incorporates a plurality of functional blocks 501 , 502, 503,... developed by 
different manufacturers. The designs of the three functional blocks 501 , 502, and 503 shown in Fig. 25 were based on 
libraries developed independently by Company A, Company B, and Company C, respectively. Assume that this system 

20 LSI 500 is manufactured by Company C. Company C has received consent from Company A and Company B concern- 
ing intellectual property rights for the burning of the functional blocks 501 and 502, to manufacture this system LSI. 
[0279] In this case, if Company A and Company B are to calculate the number of licenses based on the number of 
system LSIs manufactured by Company C, it is necessary for Company C to provide an accurate declaration concern- 
ing the number of LSIs to be manufactured, which means that the management of these licenses is complicated for all 

25 of Companies A to C. 

[0280] This seventh embodiment relates to an improvement in the way in which the responsibility for license manage- 
ment is split between the development manufactures and the fabrication manufacturer. 

[0281] For example, the decryption key (private key) of the previously described RSA method is disclosed by license 
to each end user of the functional block 501 that was developed by Company A. This seventh embodiment is devised 
30 in a manner to ensure that only end users having the decryption key can make use of the functional block 501, from 
among all the users of the system LSI 500. 

[0282] The configuration and operation of the functional block 501 will now be described. 

[0283] The functional block 501 has a four-arithmetical-operations function area. This four-arithmetical-operations 
function area first executes at least one of encryption and decryption, when the predetermined decryption key that was 
35 given to the end user is input. After this computation is established, this functional block 501 is able to perform four-arith- 
metical-operations other than encryption and decryption. 

[0284] More specifically, the functional block 501 has the x numbers of arithmetic units 7-1 to 7-x shown in Fig. 2, 
which operate as four-arithmetical-operations functional units. Some of these units can also be used as the power res- 
idue arithmetic units of Fig. 5 or Fig. 13B. 

40 [0285] To enable the general four-arithmetical-operations function of the functional block 501 in this case, the neces- 
sary condition therefor could be specified as the deciphering of the encrypted text M^C mod n of Equation 3 given in _ 
the section describing the principle of the RSA method used by the second embodiment. In such a case, the end user 
inputs the decryption key that was acquired beforehand by license, then uses the plurality of power residue arithmetic 
units within the functional block 501 to execute the computation of Equations 4 and 5 given in the section describing the 

45 principle of the RSA method used by the second embodiment, thus decrypting the encrypted text. The result of this 
decryption enables the general four-arithmetical -operations function using the x numbers of arithmetic units 7-1 to 7-x 
of Fig. 2 provided within the functional block 501 , so that the functional block 501 can be used. 
[0286] Note that if it is too complicated to input the decryption key every time the functional block 501 is used, a stor- 
age section could be provided for holding the decryption key acquired by license. 

so [0287] Thus the functional block 501 is provided with the configuration of the first embodiment shown in Fig. 2, and 
part thereof is used for decrypting the encrypted text as described in the second embodiment. 
[0288] It should be obvious in this case that the techniques of the fourth to sixth embodiments could be used for signal 
transmissions between the functional blocks of the system LSI 500 and within each functional block, so that optical 
transmissions are used therefor. 

55 

Claims 

1. An information processing system in which computational processing is performed on input data in accordance 
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with a processing sequence, for outputting data, said information processing system comprising: 

a plurality of arithmetic units,, each computing to an arithmetic precision of 2 m bits (where m is a natural 
number); based on said processing sequence; and 
s k; plurality of cascade connection terminals for cascading said arithmetic units each other, 

.wherein, when the maximum arithmetic precision that is required during computational processing is 2 n bits 
(where n is a natural number and is fixed), x numbers of (where x is a natural number) said arithmetic units are 
cascaded in a manner such that the inequality x £ 2 n /2 m is satisfied. 

io 2. The information processing system as defined in claim 1 , wherein, 

when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is required during computational 
processing, x1 numbers of said arithmetic units are cascaded in a manner such that the inequality 
x1 £ 2 n1 /2 171 (where x1 is a natural number and is variable) is satisfied. 

15 

3. The information processing system as defined in claim 2, further comprising: 

a clock generation circuit for generating a reference clock signal to be input to said plurality of arithmetic units, 
wherein said clock generation circuit generates 2 n1 numbers of reference clock pulses during computational 
20 processing. 

4. The information processing system as defined in any one of claims 1 to 3, 
wherein optical signals are transmitted between said plurality of arithmetic units. 

25 5. The information processing system as defined in any one of claims 1 to 3, 

wherein each of said plurality of arithmetic units comprises y numbers of arithmetic modules (where y is a natural 
number and is fixed), each computing at an arithmetic precision of 2 m/ * when y numbers of said arithmetic modules 
are cascaded. 

30 6. The information processing system as defined in claim 1 , wherein: 

at least one of said plurality of arithmetic units comprises y numbers of arithmetic modules (where y is a natural 
number and is fixed), each capable of computing at an arithmetic precision of 2 m/ y; and 
when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is required during computational 
35 processing, said y numbers of arithmetic modules are cascaded in a manner such that the inequality 

y1 £ 2 n1 /2 (where yl is a natural number and is variable) is satisfied. 

7. The information processing system as defined in claim 1 , wherein: 

40 computations are executed by (x1 - 1) numbers of cascaded arithmetic units at an arithmetic precision of 2 02 

(where n2 < n1) for performing computations at a 2 n1 -bit precision; and 

y1 numbers of arithmetic modules in one other arithmetic unit, which is cascaded from said (x1 - 1) numbers 
of arithmetic units, are cascaded in a manner such that the inequality y1 £ (2 n1 - 2 n2 )/2 (where y1 <, y, and 
yl is variable) is satisfied. 

45 

8. The information processing system as defined in claim 6 or 7, further comprising: 

a clock generation circuit for generating a reference clock signal to be input to said arithmetic units, 
wherein said clock generation circuit generates 2 n1 numbers of reference clock pulses during computational 
so processing. 

9. The information processing system as defined in any one of claims 5 to 8, 

wherein optical signals are transmitted between said y numbers of arithmetic modules. 

55 10. The information processing system as defined in any one of claims 1 to 9, further comprising: 
a first storage section for storing said processing sequence. 
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11. The information processing system as defined in claim 10, further comprising: 
computation control means for controlling said plurality of arithmetic units based on said processing sequence. 

12. The information processing system as defined in claim 1 1 , 
wherein optical signals are transmitted between said plurality of arithmetic units and said computation control 
means. 

1 3. The information processing system as defined in any one of claims 1 to 1 1 . further comprising: 

a second storage section for temporarily storing an arithmetic result produced by said plurality of arithmetic 
units, wherein optical signals are transmitted between said second storage section and said plurality of arith- 
metic units. 

75 14. The information processing system as defined in any one of claims 1 to 13, 

wherein, when said input data is X and N and output data is Y, at least one of said arithmetic units executes com- 
putational processing to compute: 



10 



20 



Y = X 2 mod N. 

15. The information processing system as defined in any one of claims 1 to 14, 

wherein, when the yield of said plurality of arithmetic units is A and the total number of arithmetic units that are 
available is K, a maximum number x (where x £ K/A) of said arithmetic units that are operative are cascaded. 

25 16. The information processing system as defined in any one of claims 5 to 7, 

wherein, when the yield of said arithmetic modules is A and the total number of arithmetic modules provided in 
each of said plurality of arithmetic units is L, a maximum number y (where y ^ LVA ) of said arithmetic modules that 
are operative are cascaded. 

30 17. An information processing system in which computational processing is performed on input data in accordance 
with a processing sequence, for outputting data, said information processing system comprising: 

a plurality of internal arithmetic units, each computing to an arithmetic precision of 2 m1 bits (where ml is a nat- 
ural number and is fixed), based on said processing sequence; 
35 a plurality of external arithmetic units for computing to an arithmetic precision of 2 m2 bits (where m2 is a natural 

number and is fixed); and 

cascade connection terminals for cascading said plurality of internal arithmetic units and said plurality of exter- 
nal arithmetic units, 

wherein, when the maximum arithmetic precision required during computational processing is 2 n bits (where n 
40 is a natural number and is variable), z numbers of said external arithmetic units are cascaded in a manner such 

that the inequality z ^ (2 n - 2 m1 )/2 012 (where z is a natural number and is fixed) is satisfied. 

18. The information processing system as defined in claim 17, wherein, 

45 when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is required during computational 

processing, z1 numbers of said external arithmetic units are cascaded in a manner such that the inequality 
z1 £ (2 n1 - 2 m1 )/2 m (where z1 is a natural number and is variable) is satisfied. 

19. The information processing system as defined in claim 17 or 18, 

so wherein optical signals are transmitted between said plurality of internal arithmetic units and said plurality of exter- 
nal arithmetic units. 

20. The information processing system as defined in any one of claims 17 to 19, 

wherein each of said plurality of external arithmetic units comprises y numbers of (where y is a natural number and 
55 is fixed) arithmetic modules performing operations at an arithmetic precision of 2 m2/y when said y numbers of arith- 
metic modules are cascaded. 

21. The information processing system as defined in claim 20, 
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wherein optical signals are transmitted between said y numbers of arithmetic modules. 

22. The information processing system as defined in any one of claims 17 to 21, further comprising: 
5 a storage section for storing said processing sequence. 

23. The information processing system as defined in claim 22, further comprising: 

computation control means for controlling said plurality of internal arithmetic units and said plurality of external 
10 arithmetic units based on said processing sequence. 

24. The information processing system as defined in any one of claims 1 7 to 23, wherein, 

when said input data is X and N and output data is Y, said plurality of internal arithmetic units and said plurality 
75 of external arithmetic units execute computational processing to compute: 

Y = X 2 mod N. 

25. The information processing system as defined in any one of claims 17 to 24, 

20 wherein, when the yield of said plurality of external arithmetic units is A and the total number of external arithmetic 
units is K, a maximum number z (where z £ K/A ) of said arithmetic units that are operative are cascaded. 

26. The information processing system as defined in claim 20 or 21 , wherein, 

25 when the yield of said arithmetic modules is A* and the total number of arithmetic modules provided in each of 

said plurality of internal arithmetic units and at least one of said external arithmetic units is L, a maximum 
number y (where y £ L/A* ) of said arithmetic modules that are operative are cascaded. 

An encryption/decryption system comprising: 

a plurality of power residue arithmetic units; and 

a plurality of cascade connection terminals for cascading said plurality of power residue arithmetic units, 
wherein each of said plurality of power residue arithmetic units comprises: 

a multiplication unit for performing a multiplication at a multiplication precision of 2 m bits (where m is a natural 
number and is fixed); and 

a division unit for performing a division at a division precision of 2 2xm bits, 

wherein, when the maximum arithmetic precision of a power residue computation executed by said plurality of 
power residue arithmetic units is 2 n (where n is a natural number and is fixed), x numbers of said plurality of 
power residue arithmetic units are connected together for encryption and decryption, in a manner such that the 
following inequality is satisfied: 

x £ 2 n /2 m (where x is a natural number and is fixed). 

28. The information processing system as defined in claim 27, 

45 wherein, when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is required during computational 
processing, x1 numbers of said plurality of power residue arithmetic units are cascaded in a manner such that the 
inequality x1 ^ 2 n1 /2 m (where x1 < x and is variable) is satisfied. 

29. The encryption/decryption system as defined in claim 27 or 28, 

so wherein optical signals are transmitted between said plurality of power residue arithmetic units. 

30. The encryption/decryption system as defined in any one of claims 27 to 29, wherein: 

said multiplication unit comprises: 
55 y numbers of multiplication modules for performing multiplications at a multiplication precision of 2™^ (where y 

is a natural number and is fixed); and 

cascade connection terminals for connecting said y numbers of multiplication modules; and 
said division unit comprises: 
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y numbers of division modules for performing divisions at a division precision of 2 x(m M ; and 
casc&de connection terminals for connecting said y numbers of division modules. 

31. The encryption/decryption system as defined in claim 30, 

s wherein optical signals are transmitted between said y numbers of multiplication modules and between said y num- 

bers of multiplication modules. 

32. Ah encryption/decryption system comprising: 

io a plurality of internal power residue arithmetic units for performing power residue computation at a multiplica- 

tion precision of 2 m1 bits (where ml is a natural number and is fixed) and a division precision of 2 2xm1 bits; 
a plurality of external power residue arithmetic units for performing power residue computation at a multiplica- 
tion precision of 2 m2 bits (where m2 is a natural number and is fixed) and a division precision of 2 2xm2 bits; and 
cascade connection terminals for cascading said pluralities of internal and external power residue arithmetic 

is units, 

wherein, when encryption and decryption is performed at a maximum bit precision of 2 n (where n is a natural 
number and is fixed), z numbers of said external power residue arithmetic units are cascaded in a manner such 
that the following inequality is satisfied: 

20 z £ (2 n - 2 m1 )/2 (where z is a natural number and is fixed). 

33. The encryption/decryption system as defined in claim 32, 

wherein, when an arithmetic precision of 2 n1 bits (where n1 <, n, and n1 is variable) is required during computational 
processing, z1 numbers of external power residue arithmetic units are cascaded in a manner such that the inequal- 
25 ity z1 £ (2 n - 2 m1 )/2 m2 (where z1 < z and is variable) is satisfied. 

34. The encryption/decryption system as defined in claim 32 or 33, 

wherein optical signals are transmitted between said plurality of internal power residue arithmetic units and said 
plurality of external power residue arithmetic units. 

30 

35. The encryption/decryption system as defined in any one of claims 32 to 34, 

wherein each of said plurality of external power residue arithmetic units comprises y numbers of external power 
residue arithmetic modules for performing power residue computations at a multiplication precision of 2™^ (where 
y is a natural number and is fixed) and a division precision of 2 2x(m/y) when said y numbers of external power res- 
35 idue arithmetic modules are cascaded. 

36. The encryption/decryption system as defined in claim 35, 

wherein optical signals are transmitted between said y numbers of external power residue multiplication modules. 

40 37. The encryption/decryption system as defined in any one of claims 32 to 36, wherein, 

when the yield of said plurality of external power residue arithmetic units is A and the total number of arithmetic 
units that are available is K, a maximum number z (where z £ K/A ) of said arithmetic units that are operative 
are cascaded. 

45 

38. The encryption/decryption system as defined in claim 35, 

wherein, when the yield of said external power residue arithmetic modules is A' and the total number of external 
power residue arithmetic modules provided for each of said plurality of external power residue arithmetic units is L, 
a maximum number y (where y £ L/A') of said arithmetic modules that are operative are cascaded. 

50 

39. An information processing system comprising a plurality of functional units, each operating based on multi-channel 
electrical signals, and a bus line for transmitting signals therebetween, wherein: 

each of said plurality of functional units comprises a signal output section and/or a signal input section; 
55 said signal output section comprises electrical -optical signal conversion circuit for converting multi-channel 

electrical signals into multi-channel optical signals of different wavelengths, and outputting the same; 
said signal input section comprises optical-electrical signal conversion cirucit for converting said multi-channel 
optical signals of different wavelengths into said multi-channel electrical signals; and 
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said bus line is formed by an optical transmission medium. 

40. The information processing system as defined in claim 39, 

wherein one of said plurality of functional units is a central processing unit (CPU), and said bus line comprises a 
s data bus line and an address bus line. 

41. The information processing system as defined in claim 40, 

wherein one optical transmission medium is used in common for said data bus line and said address bus line. 

10 42. The information processing system as defined in claim 40 or 41, 

wherein an optical input-output section for providing optical communications with peripheral equipment is con- 
nected to said bus line. 

43. The information processing system as defined in claim 42, 

is wherein said optical input-output section enables optical communications to and from said peripheral equipment, 
by a chip select signal transmitted from said central processing unit through said bus line. 

44. Electronic equipment comprising the information processing system as defined in any one of claims 39 to 43. 

20 45. A system LSI formed of a first semiconductor device and a second semiconductor device connected together by an 
external bus line, wherein: 

said first semiconductor device comprises a first signal output section and a first signal output section, and a 
central processing unit (CPU) operating based on multi-channel electrical signals is formed on a first substrate; 
25 said second semiconductor device comprises a second signal output section and a second signal input sec- 

tion, and a controlled unit that is controlled by signals from said central processing unit and operates based on 
multi-channel electrical signals is formed on a second substrate; 

each of said first and second signal output sections comprises electrical-optical signal conversion circuit for 
converting multi-channel electrical signals into multi-channel optical signals having different wavelengths, and 
30 outputting the same; 

each of said first and second signal input sections comprises optical-electrical signal conversion circuit for con- 
verting multi-channel optical signals having different wavelengths into multi-channel electrical signals; and 
said external bus line is formed by an optical transmission medium. 

35 46. The system LSI as defined in claim 45, wherein said controlled unit is cache memory. 

47. Electronic equipment comprising the system LSI as defined in claim 45 or 46. 

48. A system LSI incorporating a plurality of functional blocks developed by different manufacturers, wherein: 

40 

at least one of said plurality of functional blocks comprises a four-arithmetical -operations function area; and 
said four-arithmetical-operations function area executes computations for decryption when a predetermined 
decryption key has been input thereto, and a four-arithmetical-operations function other than said decryption 
is enabled after said decryption is established. 

45 

49. The system LSI as defined in claim 48, wherein: 
said four-arithmetical-operations function area comprises: 

a plurality of arithmetic units, each computing to an arithmetic precision of 2 m bits (where m is a natural number 
and is fixed), based on a processing sequence; and 

a plurality of cascade connection terminals for cascading said arithmetic units, 

wherein, when the maximum arithmetic precision that is required during computational processing is 2 n bits 
(where n is a natural number and is fixed), x numbers of said arithmetic units are cascaded (where x is a natural 
number and is fixed) in a manner such that the inequality x £ 2 n /2 m is satisfied. 

The system LSI as defined in claim 49, wherein: 

when an arithmetic precision of 2 n1 bits (where n1 <, n, and n1 is variable) is required during computational 
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processing, x1 numbers of said arithmetic units are cascaded in a manner such that the inequality 
x1 £ 2 n1 /2 171 (where x1 is a natural number and is variable) is satisfied. 

51. The system LSI as defined in claim 49 or 50, 

5 wherein optical signals are transmitted between said plurality of arithmetic units. 

52. The system LSI as defined in claim 49 or 50, wherein: 

a portion of said plurality of arithmetic units comprises a plurality of power residue arithmetic units that are cas- 
10 caded; 

each of said plurality of power residue arithmetic units comprises: 

a multiplication unit for performing a multiplication at a multiplication precision of 2 m bits (where m is a natural 
number and is fixed); and 

a division unit for performing a division at a division precision of 2 2xm bits; and 
is when the maximum arithmetic precision of a power residue computation executed by said plurality of power 

residue arithmetic units is 2 n (where n is a natural number and is fixed); x numbers of said plurality of power 
residue arithmetic units are connected together for decryption in a manner such that the following inequality is 
satisfied: 

20 x £ 2 n /2 m (where x is a natural number and is fixed). 

53. The system LSI as defined in claim 52, 

wherein, when an arithmetic precision of 2 n1 bits (where n1 < n, and n1 is variable) is required during computational 
processing, x1 numbers of said plurality of power residue arithmetic units are cascaded in a manner such that the 
25 inequality x1 ^ 2 n1 /2 m (where x1 <, x and is variable) is satisfied. 

54. The system LSI as defined in claim 52 or 53, 

wherein optical signals are transmitted between said plurality of power residue arithmetic units. 

30 55. Electronic equipment comprising a system LSI as defined in any one of claims 48 to 54. 
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